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METHODS AND COMPOSITIONS FOR DIAGNOSIS OF HYPERHQMOC YSTE INEMI A 

FIELD OF THE INVENTION 

The invention relates to methods for identifying 
genes which are transcriptionally modulated by the level of 
extracellular homocysteine or other metabolites characteristic 
of hyperhomocystenemia, and/or which correlate with the 
pathological condition of homocysteinemia . The invention 
relates to polynucleotides, polypeptides, diagnostic methods 
and compositions, gene therapy methods and compositions, and 
antibodies, and other related embodiments. The present 
invention provides a novel method for enriching, isolating, 
and identifying polynucleotide sequences of low abundance RNA 
species which are differentially expressed in individuals 
having elevated levels of serum homocysteine. The invention 
provides methods and compositions relating to the fields of 
molecular biology, chemistry, pharmacology, and medical and 
diagnostic technology. 

BACKGROUND AND DESCRIPTION OF RELATED ART 

Homocvsteinemia and Related Pathology 
Cardiovascular disease due to atherosclerosis, and 
arterial and venous thromboembolism are a major cause of 
mortality and morbidity in North America. Although 
conventional risk factors such as hypercholesterolemia, 
hypertension and smoking are associated with increased risk 
for cardiovascular disease, they do not account for all cases. 
There is mounting evidence that elevated blood levels of 
homocysteine (hyperhomocysteinemia) represent a significant 
risk factor for premature vascular and thrombotic disease. 
Hyperhomocysteinemia has been found in approximately 20-30% of 
individuals identified with coronary artery disease, 
peripheral vascular disease and stroke (Boers et al. (1985) N. 
Engl. J. Med. 313 : 709; Clarke et al . (1991) N. Engl. J. Med. 
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324: 1149; Sethub et al . (1995) N. Engl . J. Med. 332 : 286; den 
Heijer et al. (1995) Lancet 345 : 882. Despite the fact that a 
number of inherited or acquired conditions can lead to 
hyperhomocysteinemia, the relationship between increased 
homocysteine levels and vascular disease is present regardless 
of the underlying metabolic causes. 

The majority of inherited cases are believed to 
result from deficiency of the enzyme cystathionine 0- synthase, 
which mediates the condensation of homocysteine with serine to 
form cystathionine. Individuals homozygous for cystathionine 
0- synthase deficiency suffer from ocular, skeletal and 
neurologic complications, and are at high risk for premature 
vascular disease and venous thrombosis (Mudd et al. (1981) Attl. 
J. Hum. Genet. 33: 883; Ueland et al . (1989) J. Lab. Clin. 
Med. 114 : 473; Ueland et al . Plasma h omocysteine and 
cardiovascuar disease, in Atherosclerotic Cardiovascular 
Disease, Hemostasis , and Endothelial Function Francis RB, Ed. 
(1993) Marcel Dekker, Inc., pp. 183-236). The homozygous form 
of cystathionine 0-synthase deficiency is associated with a 9% 
incidence of myocardial infarction, a 38% incidence of 
thromboembolic disease and a 23% incidence of cerebrovascular 
and frequent peripheral vascular events (Grieco et al . (1977) 
Am. J. Med. Sci . 273 : 120) . Despite the autosomal recessive 
inheritance, there now is evidence for an association between 
heterozygosity for cystathionine (3- synthase deficiency and 
premature vascular disease (Clarke et al. (1991) op.cit; Boers 
et al. (1985) op.cit ; Malinow et al. (1989) Circulation 79: 
1180; Kang.et al . (1986) J. Clin. Invest. 77: 1482). Based on 
the frequency of the homozygous condition at birth, it is 
estimated that at least 1-2 percent of the population (not 
including individuals with other different types of acquired 
or inherited abnormalities of homocysteine catabolism) has 
hyperhomocysteinemia and are at risk for premature vascular 
disease and thrombosis. 

Although the identification of homozygous 
cystathionine j8- synthase deficiency can be made on the basis 
of markedly elevated plasma levels of homocysteine, this means 
of diagnosis is inconvenient and is poorly suited for 
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identifying the more common heterozygous state because plasma 
homocysteine levels in heterozygous individuals can overlap 
with those found in healthy individuals (McGill et al . (1990) 
Am , J . Med . Genet . 36 : 45) . This problem can be partially 
circumvented using a methionine -loading test in which L- 
methionine is given orally after an overnight fast. Venous 
blood samples are obtained immediately before and four to 
eight hours after methionine administration for measurement of 
serum non-protein bound homocysteine. Homocysteine levels are 
then determined using ion-exchange chromatography. This 
approach is both time-consuming and uncomfortable for the 
patient and fails to identify all heterozygotes (Boers et al . 
(1985) op . cit : Kang et al. (1986) op. cit ) . Other methods are 
available to identify heterozygotes but these are even more 
problematic. For example, the activity of cystathionine 
synthase can be determined from cultured fibroblasts but this 
requires a skin biopsy, an invasive procedure. Recently, a 
bacterial screening-expression system has been developed to 
identify patients with hyperhomocysteinemia caused exclusively 
by cystathionine iff- synthase deficiency (Kozich and Kraus 
(1992) Hum. Mut . li 113). This system also allows for the 
characterization of the mutation within the cystathionine /S- 
synthase gene. There are, however, at least two major 
disadvantages with this approach. Firstly, it can only be 
used to identify individuals deficient in the enzyme 
cystathionine /3-synthase. Therefore, patients with other 
forms of genetic or acquired conditions which predispose them 
to hyperhomocysteinemia cannot be identified. Secondly, 
allelic and genetic heterogeneity within the cystathionine (3- 
synthase gene complicates the reliable use of this approach. 

At present, there is no simple and reliable test to 
identify individuals with hyperhomocysteinemia. This is a 
serious drawback because some forms of hyperhomocysteinemia 
can often be corrected by vitamin supplementation (Ubbink et 
al. (1993) Clin. Invest. 71 ; 993; Franken et al . (1994) 
Arteroscler . Thromb . 14: 465. Since such treatment results in 
only minor side effects, it is important to identify the 
individuals at risk for hyperhomocysteinemia so that they can 
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be targeted for treatment aimed at reducing plasma 
homocysteine, thereby reducing the risk of vascular disease. 
Furthermore a simple diagnostic test would provide a method to 
monitor the success of such treatment and its impact on 
subsequent atherosclerosis . 

Despite intensive study the exact mechanisms 
responsible for homocysteine -induced vascular disease remain 
unclear. However, several recent reports have shown that 
homocysteine contributes to endothelial cell damage and 
dysfunction in vitro . These include induction of a protease 
activator of Factor V (Rodgers and Kane (1986) J. Clin. 
Invest . 77: 1909) , induction of an inhibitor of protein C 
activation (Rodgers and Conn (1990) Blood 75: 895), aberrant 
processing and secretion of thrombomodulin (Lentz and Sadler 
(1991) j. Clin. Invest. 88: 1906) and von Willebrand Factor 
(Lentz and Sadler (1993) Blood 81: 683), and inhibition of 
thrombomodulin cofactor activity (Hayashi et al. (1992) Blopd 
79 ; 2930) . Alteratively , homocysteine may promote the 
proliferation of vascular smooth muscle cells, a major 
component in atherosclerotic plaque, while decreasing 
endothelial cell proliferation (Tsai et al. (1994) Proc. Natl. 
Acad. Sci. (U.S.A. ) 91; 6369) . Marked platelet accumulation at 
sites of vascular injury and platelet rich occlusive thrombi 
are also distinctive pathologic features of both human and 
experimental hyperhomocysteinemia (James et al . (1990) J . Am. 
Coll. Cardiol. 15: 763; Harker et al . (1983) Circ . Res. 53: 
731). In addition, platelet survival appears to be shortened 
in patients with hyperhomocysteinemia. However, the 
mechanisms involved in homocysteine -induced platelet 
dysfunction also remain elusive. 

The diagnosis of cardiovascular disease 
requires a detailed clinical evaluation, and diagnosis 
generally cannot be made until significant symptoms of 
coronary artery occlusion and plaque formation are clinically 
apparent. Thus, there is a need for a simple and reliable 
diagnostic method for hyperhomocysteinemia, which is 
associated with a substantial fraction of atherosclerotic 
disease patients, and for methods to use such tests to 
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identify patients with hyperhomocysteinemia and monitor the 
effect of pharmacologic interventions on homocysteine- induced 
pathology. 

5 Enrichment Cloning Methods and Differential RNA Display 

Selective amplification of polynucleotides 
represents a major research goal of molecular biology, with 
particular importance in diagnostic and forensic applications, 
as well as for general manipulations of genetic materials and 
10 laboratory reagents. 

Comparisons of gene expression in different cell 
types or in a single cell type under different conditions 
provides a basis for analyzing the underlying biological 
processes controlling cell differentiation and metabolism, and 
15 can be used as a pathognomonic indicator of a pathological 
condition or predisposition to developing a pathological 
condition . 

One way to compare gene expression between two cell 
populations in vitro or in vivo is to identify mRNA species 

20 which are differentially expressed between the cell 

populations (i.e., present at different abundances between the 
cell populations) . Current methods that distinguish mRNAs in 
comparative studies largely rely on subtractive hybridization 
(Lee et al. (1991) Proc. Natl. Acad. Sci. (U.S.A. ) 88.: 2825) 

25 or differential display employing arbitrary primer polymerase 
chain reaction (PCR) (Liang and Pardee (1992) Science 257 : 
967) . Each of these methods has been used by various 
investigators to identify differentially expressed mRNA 
species, including mRNAs which have increased or decreased 

30 abundance in cancer cells (Salesiotis et al . (1995) Cancer 

Lett . 91 : 47; Jiang et al . (1995) Oncogene 10: 1855; Blok et 
al . (1995) Prostate 26 : 213; Shinoura et al . (1995) Cancer 
Lett . 89: 215; Murphy et al . (1993) Cell Growth Differ 4: 715;. 
Austruy et al . (1993) Cancer Res. 53: 2888; Zhang et al . 

35 (1993) Mol. Carcinoa. 8: 123; and Liang et al . (1992) Cancer 
Res . 52 : 6966) . The methods have also been used to identify 
mRNA species which are induced or repressed by drugs or 
certain nutrients (Fisicaro et al . (1995) Mol . Immunol . 32 : 
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565; Chapman et al . (1995) Mol . Cell . Endocrinol. 108: 108; 
Douglass et al - (1995) J. Neurosci . 15: 2471; Aiello et al. 

(1994) Proc. Natl. Acad. Sci. (U.S.A. ) 91: 6231; Ace et al . 

(1994) Endocrinology 134 : 1305. However, there are no reports 
of these methods being used to identify mRNA species which are 
differentially expressed in cells exposed to 

hyperphysiological levels of homocysteine either in vitro or 
in vivo . 

Since no in vitro biochemical diagnostic test is 
presently available for rapidly and conveniently diagnosing 
hyperhomocysteinemia in its early stages and for identifying 
disease candidates. There is a need for the identification of 
polynucleotides (e.g., cDNA species) which are aberrently 
expressed in a cell sample (e.g., white blood cells or 
platelets isolated from peripheral blood) from a patient 
having hyperhomocysteinemia. These polynucleotides and the 
encoded polypeptides can be used in methods to identify 
patients with hyperhomocysteinemia, to monitor the effect of 
pharmacologic interventions on homocysteine -induced pathology, 
for gene therapy vectors and as ptoential targets for drug 
development assays, among other uses. 

The references discussed herein are provided solely 
for their disclosure prior to the filing date of the present 
application. Nothing herein is to be construed as an 
admission that the inventors are not entitled to antedate such 
disclosure by virtue of prior invention. All publications 
cited are incorporated herein by reference, whether 
specifically noted as such or not. 

SUMMARY OF THE INVENTION 

The present invention provides compositions and 
methods for the diagnosis and treatment of atherosclerotic 
diseases and diseases of metabolism of sulfur-containing amino 
acids (e.g., homocysteinemia) which are associated with 
vascular damage and atherosclerotic disease. 

The invention provides genetic and immunological 
markers for the successful diagnosis and treatment of 
homocysteine- induced vascular disease, and methods employing 
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these markers to monitor the success of interventions which 
are intended to lower plasma homocysteine levels. The genetic 
and immunological markers are also used to identify therapies 
which will be beneficial with respect to recurrent vascular 
5 disease and thrombosis related to hyperhomocystenemia and 

provide a basis for developing pharmaceutical agents which, 
when administered to a patient or cell culture, alter 
homocysteine- induced gene regulation and/or the level of 
homocysteine or other sulfur-containing amino acids in the 

10 extracellular environment. 

The methods and compositions of the invention will 
find use in the prophylaxis and treatment of various 
cardiovascular conditions, such as unstable angina, acute 
myocardial infarction (heart attack) , cerebrovascular 

15 accidents (stroke), hypertension, renal artery stenosis, 

aortic stenosis, deep vein occlusive disease, and the like. 
The present invention is not limited to such uses, however, 
and the compositions and methods described herein may find use 
in other in vitro and in vivo situations whenever it is 

20 desirable to inhibit atherosclerotic plaque formation and 
accretion or to promote normal homeostasis of sulfur- 
containing amino acid metabolism. 

A basis of the present invention is the use of 
subtract ive hybridization of a population of cDNA species to 

25 selectively enrich for cDNA species which are preferentially 
expressed in cells subjected to elevated levels of 
extracellular homocysteine or other sulfur- containing amino 
acids or metabolites thereof. Another basis of the invention 
is a method using a polymerase-mediated chain extension, such 

30 as for example PCR, and differential amplification and/or 
display and/or recovery of reaction products, such as on a 
sequencing gel or otherwise, to effect isolation of cDNA 
species representing sequences of polynucleotide species, such 
as mRNA or primary or intermediate RNA transcripts, which are 

3 5 substantially increased or substantially decreased in 

abundance in mammalian cells exposed to hyperphysiological 
levels of homocysteine, such as occurs in patients having 
hyperhomocysteinemia . A variation of the method is used to 
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identify mRNA species which exhibit an alter splicing pattern 
and/or mRNA length in the presence of hyperphysiological 
levels of homocysteine. The method is employed to isolate and 
identify cDNA species containing sequences of the mRNA species 
which typically encode mammalian proteins, such as, for 
example, a polypeptide which is induced or repressed by 
elevated levels of extracellular homocysteine. 

In a broad aspect, the invention provides a method 
for identifying individual humans at risk for developing a 
atherosclerotic vascular disease, such as coronary artery 
disease, cerebral infarction, myocardial infarction, angina, 
unstable angina, hypertension, lower extremity phlebitis and 
deep vein thrombosis, and poor peripheral circulation, and 
related diseases causally related to atherosclerosis. The 
method comprises identifying individuals having elevated 
levels of serum homocysteine by determining the relative 
expression level of at least one homocysteine -regulated mRNA 
in a cellular sample (e.g., white blood cells or platelets 
isolated from peripheral blood) obtained from the individual; 
if the determination demonstrates that the relative expression 
level of the homocysteine-regulated mRNA is in a predetermined 
pathognomonic range, the individual is identified as a 
atherosclerosis disease candidate and a hyperhomocystenemia 
candidate. Such disease candidates can be treated with 
therapies intended to reduce the level of serum homocysteine, 
which may include vitamin therapy and other modalities. The 
determination of the relative expression level of the 
homocysteine-regulated mRNA(s) is performed by a suitable 
diagnostic assay, which may include: (1) hybridization of an 
RNA sample of the sample to a polynucleotide probe of a 
predetermined sequence known to hybridize to said 
homocysteine-regulated mRNA, or (2) PCR, LCR, or other 
^polynucleotide amplification method employing a primer or 
primer set of a predetermined sequence known to be capable of 
priming amplification of said homocysteine-regulated mRNA(s) . 
Typically, the primer and probe sequences are obtained from a 
predetermined nucleotide sequence of an identified 
homocysteine-regulated mRNA or its cDNA. In an alternative 
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embodiment , the determination of the relative expression level 
of the homocysteine -regulated mRNA(s) is performed by 
determining the level of an encoded protein product of the 
homocysteine -regulated mRNA(s) , such as by immunoassay using 
an antibody predetermined to have a sufficient binding 
affinity and specificity to quantitatively assay the protein 
product . 

Differential Display of cDNA 
In an embodiment, the invention provides a method 
for identifying polynucleotides and polynucleotide sequences 
of mRNAs and the corresponding cDNAs which are substantially 
increased (i.e., at least 2-fold increased in relative 
abundance in the total mRNA pool) or substantially decreased 
(i.e., at least 50 percent decreased in relative abundance in 
the total mRNA pool) in cells exposed to hyperphysiological 
levels of homocysteine (e.g., 0.1 mM to about 5 mM for 
cultured endothelial or megakaryocytic cells) and/or cells 
explanted from an individual having transient or chronic 
hyperhomocysteinemia, as compared to cells exposed to 
normophysiological levels of homocysteine (e.g., less than 0.1 
mM homocysteine in culture medium or cells explanted from an 
individual having serum homocysteine levels within one 
standard deviation of the population mean. The method 
generally involves obtaining a first mRNA sample which is 
obtained from cells exposed to hyperphysiological levels and 
obtaining a second mRNA sample from cells which are exposed to 
normophysiological levels of homocysteine; the first and 
second mRNA samples are individually subjected to PCR 
amplification with a 5' arbitrary primer (s) and, optionally, a 
3' arbitrary primer (s) or an oligo(dT) 3' primer, under 
suitable reaction conditions whereby PCR amplification 
product (s) are labeled; the PCR amplification products are 
displayed by electrophoresis on a sequencing gel or other 
size -discriminatory separation method, and discrete size 
fractions (such as bands on a sequencing gel) of a desired 
type are identified by the presence and abundance of label in 
the fraction (s) ; fractions (e.g., bands on a gel) which are 
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significantly increased or decreased in the sample of reactant 
products amplified from the rnRNA sample obtained from cells 
exposed to a hyperphysiological homocysteine level relative to 
the sample obtained from cells exposed to normophysiological 
homocysteine levels are collected (such as by cutting out a 
portion of a sequencing gel corresponding to a band position) 
and the polynucleotide (s) are eluted and recovered. Often, 
multiple samples representing samples from cells exposed to 
varying levels of extracellular homocysteine (or exposure 
times) are amplified with the same arbitrary primers and the 
labeled reaction products run as one or more parallel lanes on 
said sequencing gel for comparison. Frequently, the recovered 
polynucleotides are amplified with PCR and/or cloned into a 
replicable vector and/or are sequenced. By this method, 
homocysteine -induced and homocysteine -repressed rnRNA species 
are identified. Typically, the rnRNA samples are obtained from 
cell types which are readily used for diagnostic assays and/or 
which are of the histological origin that is commonly involved 
in atherosclerotic disease (e.g., arterial intimal cells, 
smooth muscle cells, endothelial cells, fibroblasts, nucleated 
blood cells, and the like) . 

In an embodiment, rnRNA or cDNA produced therefrom 
obtained separately from (1) a cell population exposed to 
elevated homocysteine levels and (2) a control cell population 
are amplified with PCR or a similar amplification method using 
one or more 5' arbitrary primers, either alone or in 
combination with a 3' poly(dT) primer or a 3 ' poly(dT) primer 
comprising 3' arbitrary nucleotide (s) , typically two such 3' 
arbitrary nucleotide which are distinguishable from the 
poly(dT) tract as having the sequence (dT) 2 MN, wherein z is 
typically about 10-20 (usually 11 to 15) , M is dA, dC, or dG, 
and N is dA, dT, dC, or dG. Additional predetermined sequence 
may be present in the 5' portion of the 5' arbitrary primer 
and/or in the 5' portion of the 3' arbitrary primer (e.g., a 
restriction enzyme cutting site) . Often, a series of parallel 
amplifications are conducted, each employing an aliquot or an 
equivalent of the rnRNA or cDNA in conjunction with a subset 
(typically one) of a collection of 5' arbitrary primers in 



WO 97/25440 PCT/CA97/00001 

11 

combination with a 3' poly(dT) primer or a subset (typically 
one) of a 3' poly(dT) primer comprising 3' arbitrary 
nucleotide (s) . If desired, the sequences of the set of 5' 
arbitrary primers and the set of 3' arbitrary primers (if 
5 used) can be selected by the artisan such that while any given 
primer (or pair of primers) can prime the amplification of 
only a subpopulation of the mRNA or cDNA population, the 
entire set of primers is theoretically capable of collectively 
priming most or all of the species in the mRNA or cDNA 

10 population. Such a set of arbitrary primers capable of 

collectively priming amplification of at least a portion of 
substantially all species in the mRNA or cDNA population, 
while individually priming amplification of only a 
subpopulation, is referred to herein as a "coverage set" of 

15 arbitrary PCR primers. Often, arbitrary primers have typically 
one or two predetermined nucleotide at the 3' end, and which 
may include additional predetermined nucleotide at the 5' end 
(e.g., such as for a restriction site and/or to generate a 
clonable overlapping end) . 

20 

Subtractive Libraries 
In an embodiment of the invention, a subtraction 
cDNA library is used to identify mRNA sequences which are 
expressed in cells exposed to high levels of homocysteine and 

2 5 which are substantially not expressed or undetectable in 

control cells which are exposed to physiological levels of 
homocysteine or which are homocysteine -def icient . The 
subtracted cDNA library is produced by subtraction 
hybridization between: (1) a first cDNA population synthesized 

3 0 by reverse transcription of a mRNA sample from a first cell 

population and synthesis of second strand cDNA, and (2) a 
second cDNA population synthesized by reverse transcription 
and second strand synthesis of a mRNA sample from a second 
cell population, wherein said first cell population and said 
3 5 second cell population differ with regard to their exposure to 
homocysteine, typically by at least one order of magnitude or 
more, such that at least one RNA species is expressed in one 
cell population at a relative abundance which is at least two 
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times, typically 10 times or more as abundant in the cellular 
RNA pool as compared to the other cell population. The 
subtractive hybridization procedure selectively removes said 
second cDNA population and DNA hybrids formed between one 
strand of a member of said first cDNA population and a 
complementary sequence in a member of said second cDNA 
population, thereby leaving a pool of subtracted cDNA enriched 
for sequences which are substantially more abundant in the 
first cDNA population than in the second cDNA population. 
Optionally, one or more additional cycles of subtractive 
hybridization are performed on the pool of subtracted cDNA. 
Separation of hybridized members of the first cDNA population 
from single -stranded or unhybridized double -stranded members 
of the first cDNA population can be accomplished by a variety 
of methods. Typically, the second cDNA population (also 
referred to as "driver c DNA 41 ) is labeled with a recoverable 
label, such as biotinyl or digoxigenyl groups, so that the 
pool of enriched driver cDNA is substantially derived from the 
first cDNA populations. Other means for removing driver cDNA 
and hybrids therewith can be used. 

In an embodiment, the first cDNA population 
comprises an RNA species which is substantially absent or 
undetectable in said second cDNA population. For example but 
not limitation, the first cDNA population may comprise a cDNA 
containing a coding or noncoding sequence of an mRNA for a 
mammalian protein involved in metabolism of sulfur- containing 
amino acids or other metabolic or structural function, wherein 
the second cDNA population may substantially lack such mRNA 
specie (s) . Subtractive hybridization can be performed in a 
hybridization reaction between the first cDNA population and 
the second cDNA population, typically wherein the second cDNA 
population comprises a recoverable label such as a covalently 
attached biotinyl substitutent or the like. Selective removal 
of driver /tester cDNA hybrids and unhybridized driver cDNA 
from the hybridization reaction is accomplished, such as by 
contacting the hybridization reaction (cDNA molecules) with a 
ligand which binds to the recoverable label present on the 
driver cDNA (e.g., avidin, streptavidin, anti-biotin antibody, 
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etc.) and can be selectively removed from the hybridization 
reaction, thereby substantially enriching the remaining 
hybridization reaction for the subtracted cDNA derived from 
the first (tester) cDNA population. One or more additional 
5 cycles of subtractive hybridization can be performed on the 

subtracted cDNA; typically against the second cDNA population 
(recursive subtractive hybridization) and/or a third (or 
subsequent) cDNA population wherein said RNA species is 
substantially absent or undetectable; usually wherein the 

10 second, third or subsequent cDNA population comprises a 
recoverable label . 

In an embodiment, the first cDNA population is 
obtained by reverse transcription of RNA obtained from a first 
cell population comprising mammalian cells exposed to elevated 

15 homocysteine or other sulfur-containing amino acid(s), and a 

second cDNA population is obtained by reverse transcription of 
RNA obtained from a second cell population comprising 
mammalian cells which are exposed to physiological conditions 
(e.g., levels of homocysteine and other sulfur-containing 

20 amino acids which are within one standard deviation of the 

population mean for humans) . Typically, the first and second 
cell populations are of the same or equivalent histological 
type (e.g., fibroblast, epithelial, endothelial, smooth 
muscle, macrophage, megakaryocyte, lymphocyte, etc.) to reduce 

25 the number of RNA species which are differentially expressed. 
Where a third (or subsequent) cDNA population is employed, it 
is generally obtained by reverse transcription of RNA from a 
cell population of mammalian cells which are histologically 
similar or distinct from the first cell population and which 

30 are similar or identical conditions. 

Polynucleotides Induced or Repressed by Homocysteine 
The present invention relates to the unexpected 
discovery that exposure of human cells to hyperphysiological 
35 levels of homocysteine produces a substantial increase or 
reduction in the relative abundance of particular mRNA 
species, especially in histological types of cells which are 
present in the vasculature (e.g., endothelial cells) or 
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circulatory system (e.g., nucleated cells of a hemopoietic 
lineage) . 

In an embodiment, the invention provides 
polynucleotides obtained by the disclosed method, wherein the 
polynucleotides comprise partial or full-length cDNA sequences 
of mRNA species which are induced (i.e., the relative 
abundance of the species in the mRNA pool is increased) at 
least 2- fold, typically by at least an order of magnitude, by 
the presence of hyperphysiological homocysteine (e.g., about 5 
mM homocysteine in cell culture) or which are repressed (i.e., 
the relative abundance of the species in the mRNA pool is 
decreased) by more than about 50 percent, typically by at 
least an order of magnitude (90 percent) , or more. The 
invention also provides the use of these polynucleotides for 
diagnosis of homocysteinemia and/or atheroscelotic disease, 
therapeutic monitoring, pharmaceutical development and 
testing, and treatment of homocysteinemia and vascular disease 
or a predisposition thereto. 

The invention also provides expressed sequence tags 
(ESTs) and polynucleotide sequences of mRNA species which are 
substantially increased or substantially decreased in relative 
abundance in cells exposed to hyperphysiological levels of 
homocysteine. The ESTs are typically at least about 2 5 
nucleotide long, often at least 35 nucleotide long, and 
frequently are at least 100 nucleotide long or longer up to a 
full-length mRNA which may comprise 300 to several thousand 
nucleotide . 

The invention provides polynucleotides comprising a 
sequence of at least 25 consecutive nucleotide which is 
identical to or complementary to portion of an mRNA species 
which is induced or repressed by homocysteine or other sulfur- 
containing amino acids; the mRNA species may encode a 
functional protein, although the polynucleotide sequence of 
the invention may comprise or consist of a non- coding 
sequence . 

In an embodiment, the invention provides a 
polynucleotide comprising a substantially full-length cDNA 
sequence identical to or complementary to a mammalian mRNA 
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species which is substantially increased or substantially 
decreased in relative abundance in cells exposed to 
hyperphysiological levels of sulfur-containing amino acids, 
such as homocysteine, referred to hereafter as 11 homocysteine - 
5 regulated genes", of which dhc-1 (Fig. 5) is an example. In 
one aspect, the polynucleotide comprises a sequence which is 
substantially identical, identical, substantially 
complementary, or complementary to least 30 consecutive 
nucleotide of the sequence designated dhc-1 and shown in Fig. 

10 5. The invention also provides a full-length dhc-1 cDNA and 
gene sequence obtained from a mammal, preferably a human. 
Accordingly, an embodiment of the invention involves the 
formation of a purified and/or isolated mammalian cDNA 
and genomic gene and fragments thereof. 

15 Polynucleotide sequences encoding dhc-1 polypeptides 

are provided. Polynucleotides comprising these sequences can 
serve as templates for the recombinant expression of 
quantities of dhc-1 polypeptides, such as full-length 
mammalian dhc-1 or immunogenic fragments thereof. Many 

20 polynucleotides comprising these sequences can also serve as 
probes for nucleic acid hybridization to detect the 
transcription rate and mRNA abundance of dhc-1 mRNA in 
individual cells ( e.g., lymphocytes or other somatic or germ 
cell types) by in situ hybridization and the like, and in 

25 specific cell populations by Northern blot analysis and/or by 
in situ hybridization (Alwine et al . (1977) Proc .' Natl. Acad. 
Sci . U.S.A. 74: 5350) and/or PCR amplification and/or LCR 
detection, or other suitable amplification method. 

Polynucleotides comprising sequences encoding dhc-1 

3 0 amino acid sequences can serve as templates for the 

recombinant expression of quantities of dhc-1 polypeptides, 
such as human dhc-1 and murine dhc-1. 

The invention also provides host cells expressing 
dhc-1 polypeptides encoded by a polynucleotide other than a 

35 naturally-occurring dhc-1 gene or homolog gene of the host 
cell (if present) . 

The invention also provides antisense 
polynucleotides complementary to polynucleotides encoding dhc- 
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1 polypeptide sequences, or non-coding portions thereof. Such 
antisense polynucleotides are employed to inhibit 
transcription and/or translation of the dhc-1 mRNA species and 
thereby effect a reduction in the amount of the respective 
dhc-1 polypeptide in a cell (e.g., an endothelial or nucleated 
blood cell of a patient) . Such antisense polynucleotides can 
function as dhc-1 -modulating agents by inhibiting dhc-1 
expression. The dhc-1 antisense polynucleotides are 
substantially identical to at least 25 contiguous nucleotide 
of the complementary sequence of the dhc-1 cDNA sequence shown 
in Fig. and denoted SEQ ID NO: 1. The dhc-2 antisense 
polynucleotides are typically ssDNA, ssRNA, methylphosphonate 
backbone nucleic acids, phosphorothiolate backbone, polyamide 
nucleic acids, and the like antisense structures known in the 
art. In one aspect of the invention, an antisense 
polynucleotide is administered to inhibit transcription and/or 
translation of dhc-1 in a cell. 

In a variation of the invention, polynucleotides of 
the invention are employed for diagnosis of pathological 
conditions or genetic disease that involve homocysteinemia, 
atherosclerosis, or other medical conditions related to the 
condition of hyperhomocysteinemia or to a homocysteine- 
regulated gene, such as dhc-1, and more specifically 
conditions and diseases that involve alterations in the 
structure or abundance of a homocysteine -regulated gene 
product (e.g., dhc-1) polypeptide, RNA transcript or splicing 
intermediate, mRNA, or genomic gene locus. 

The invention also provides antibodies which bind to 
dhc-1 with an affinity of about at least 1 x 10 7 M" 1 and which 
lack specific high affinity binding for most other 
polypeptides. Such antibodies can be used as diagnostic 
reagents to identify cells exhibiting altered dhc-1 function 
(e.g., increased dhc-1 expression) in a cellular sample from a 
patient (e.g., a lymphocyte or megakaryocte- containing blood 
sample, a solid tissue biopsy, and endothelial cell explant) , 
as commercial reagents to identify, isolate, and/or quantitate 
dhc-1 polypeptides in samples and histological specimens, and 
the like. Samples having cells with a substantially increased 
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relative amount of dhc-1 protein as compared to a control (or 
standardized) cell population of the same cell type(s) / are 
pathognomonic for hyperhomocystenemia and a predisposition to 
atherosclerosis. Frequently, anti-dhc-1 antibodies are 
5 included as diagnostic reagents for immunohistopathology 

staining of cellular samples in situ , such as histological 
sections comprising vasculature (e.g., endothelial cells and 
intima in small arterioles or capillaries. 

The invention also involves the use of the protein 

10 dhc-1 or mutein or fragment thereof for performing 

immunochemical methods for the detection and determination of 
the protein, in order to monitor therapy to reduce 
homocysteine levels or to detect or monitor the course of 
disease (e.g., hyperhomocysteinemia and/or atherosclerosis or 

15 other vascular disease) . 

The invention also provides dhc-1 polynucleotide 
probes for diagnosis of disease states (e.g., 
hyperhomocysteinemia or atherosclerosis predisposition) by 
detection of substantially elevated dhc-1 mRNA levels in cells 

20 explanted from a patient, or detection of a pathognomonic dhc- 
1 allele (e.g., by RFLP or allele-specif ic PGR analysis) 
linked to a genetic disease or predisposition. Typically, the 
detection will be by in situ hybridization using a labeled 
(e.g., 32 P, 35 S, 14 C, 3 H, fluorescent, biotinylated, 

25 digoxigeninylated) dhc-1 polynucleotide, although Northern or 
Southern blotting, dot blotting, or solution hybridization on 
bulk genomic DNA, RNA, or poly A + RNA isolated from a cell 
sample may be used, as may .PGR amplification using dhc-1- 
specific primers. Test samples containing cells which contain 

30 . a substantially increased amount of dhc-1 mRNA as compared to 
cells of the same cell type(s) from a comparable sample from a 
normal individual (or standardized normal value) identify the 
individual from which the test sample was obtained as a 
disease candidate for homocysteinemia and atherosclerosis 

35 predisposition. Similarly, the detection of pathognomonic 

rearrangements, deletion, or amplification of the dhc-1 gene 
locus or closely linked loci in a cell sample will identify 
the presence of a pathological condition or a predisposition 
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to developing a pathological condition (e.g., cancer, genetic 
disease) linked to the dhc-1 locus. The polynucleotide probes 
are also used for forensic identification of individuals, such 
as for paternity testing or identification of criminal 
suspects or unknown decedents . 

The present invention also provides a method for 
diagnosing a homocysteine -related disease (e.g., 
homocysteinemia, atherosclerosis) in a human patient, wherein 
a diagnostic assay (e.g., immunohistochemical staining of 
fixed cells by an antibody that specifically binds dhc-I 
polypeptides) is used to determine if a predetermined 
pathognomonic concentration of dhc-1 polypeptide or its 
encoding mKNA is present in a biological sample from a human 
patient; if the assay indicates the presence of dhc-1 
polypeptide or its encoding mRNA outside of the normal range 
(e.g., outside the predetermined pathognomonic concentration 
range) , the patient is diagnosed as having a disease condition 
or predisposition to developing premature atherosclerosis. 

The invention provides a composition for diagnosing 
a homocys t e ine - re lat ed pathological condition, comprising a 
polynucleotide probe comprising at least 25 nucleotide which 
are substantially identical to a dhc-1 gene sequence, 
typically at least 35 nucleotide which are substantially 
identical to dhc-1. In an aspect, the polynucleotide probe is 
labeled. In an aspect, the polynucleotide probe is 
immobilized, such as on a solid substrate or DNA probe array. 
In an aspect, the dhc-1 polynucleotide is used as a primer for 
PCR amplification, often in conjunction with a second 
( reverge ) primer which may be a dhc-1 polynucleotide, an 
oligo(dT) primer, a randomer or mixture of randomers or nested 
set of promiscuous primers, or the like; however, some 
variations may employ spontaneous self -priming via hairpin 
loop formation in the absence of a second primer. 

In a broad aspect, the invention provides a method 
for identifying individuals having an increased risk for 
developing premature atherosclerosis, the method comprising 
determining the relative abundance of a homocysteine -regulated 
mRNA in a cellular sample obtained from an individual as 
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compared to a predetermined standard value and/or as compared 
to a cellular sample obtained from one or more normal 
individuals predetermined to have serum homocysteine levels 
within one standard deviation of the population mean value. 
5 Cellular samples having a relative abundance of said 

homocysteine-regulated gene which is in a predetermined 
pathognomonic range are thereby identified as 
hyperhomocystenemia candidates and/or atherosclerosis 
candidates . 

10 In an embodiment , the present invention provides 

methods for detecting and quantifying a concentration of a 
homocysteine-regulated mRNA or protein species in biological 
samples, such as a cellular sample of a biological fluid 
(e.g., nucleated blood cells) or a tissue biopsy sample (e.g., 

15 buccal epithelium, vascular intimal cells) . 

It is a further object of the invention to provide 
methods for developing diagnostic methods and diagnostic kits 
for determining the concentration of a homocysteine-regulated 
mRNA or protein in cells of a biological sample . These 

20 methods and kits will typically comprise a polynucleotide 
comprising a sequence of at least 25 nucleotide which are 
substantially identical or complementary to a homocysteine- 
regulated mRNA, and/or an antibody which is specifically 
reactive with a protein encoded by a homocysteine-regulated 

25 gene. 

The present invention also provides a method for 
diagnosing hyperhomocysteinemia and/or a predisposition to 
early development of atherosclerosis in a human patient, 
wherein a diagnostic assay is used to determine if a 

3 0 predetermined pathognomonic concentration of a homocysteine- 
regulated mRNA is present in a cells of a biological sample 
from a human patient; if the assay indicates the presence of a 
homocysteine-regulated mRNA sufficiently above or below a 
predetermined pathognomonic concentration range, the patient 

3 5 is diagnosed as an disease candidate. Modifications of the 
method are provided which adapt the diagnostic assay for use 
in the diagnosis of human pathological conditions other than 
hyperhomocysteinemia. A diagnosis of hyperhomocysteinemia is 
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a presumption of an increased risk for premature development 
of vascular disease, such as atherosclerosis. 

In one aspect, the invention provides a 
polynucleotide (e.g., as a primer or probe) which specifically 
hybridizes to a predetermined homocysteine -regulated mRNA, 
wherein the polynucleotide is affixed to a solid substrate, 
typically wherein the solid substrate has a plurality of 
polynucleotide species affixed thereto, in a spatially defined 
array whereby each cell typically contains a single 
polynucleotide species, with the array often comprising in 
excess of 1000 distinct polynucleotide species. The probe 
polynucleotide is typically affixed by covalent linkage to the 
solid substrate. The solid substrate constitutes an array of 
polynucleotide probes and/or primers, wherein at least one 
member of the array is a probe polynucleotide for a 
predetermined homocysteine- regulated mRNA. Generally, the 
solid substrate will be less than 10 cm 3 and comprise at least 
1024 posit ionally distinct polynucleotide species, at least 
one of which is a probe polynucleotide which binds to a 
predetermined homocysteine-regulated mRNA. Such 
polynucleotides arrays on solid substrates (e.g., a 
polysilicon wafer) can be used for genotype determination, 
disease detection and diagnosis, therapeutic efficacy 
monitoring, forensic identification, or for sequencing (e.g., 
of a pool containing unknown polynucleotides; for sequencing a 
mammalian genome or cDNA library) , or other like uses. 

The invention provides a diagnostic kit for 
detecting a pathological condition, such as atherosclerosis 
and hyperhomocysteinemia, wherein the kit contains at least 
one polynucleotide predetermined to hybridize to a 
homocysteine-regulated mRNA or to prime amplification of a 
homocysteine-regulated mRNA . 

The invention provides a gene therapy method and 
compositions therefor, comprising a polynucleotide comprising 
a sequence of a predetermined homocysteine-regulated mRNA, 
which can often be operably linked to polynucleotide sequences 
to drive expression (e.g., promoter, enhancer, etc.) or other 
components of a gene therapy vector or homologous 
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recombination construct, according to method and materials 
known in the art. One variation comprises a polynucleotide in 
a viral vector for gene therapy. A variation employs a 
polynucleotide in a gene therapy delivery formulation (e.g., 
5 comprising cationic or neutral lipids, polylysine, 

polyarginine , or other delivery- enhancing vehicle) ; the 
polynucleotide comprising the homocysteine-regulated gene 
sequence may be formulated with other polynucleotides for 
therapeutic benefit. 

10 The invention also provides the use of a 

polynucleotide comprising a sequence of a homocysteine- 
regulated mRNA or gene to diagnose and/or treat disease, or to 
identify an individual based, in part, on their genotype as 
defined by a allele-specif ic restriction site pattern or 

15 nucleotide sequence, or abundance of mRNA transcripts or RNA 
splicing pattern variation . 

The invention also provide methods for isolating 
genomic gene clones of homocysteine-regulated mammalian genes. 
Such clones can include transcriptional regulatory sequences, 

20 such as promoters, enhancers, silencers, and the like; and 
often can include a sequence or portion which confers 
transcriptional responsiveness to the level of extracellular 
homocysteine (i.e., a " homocysteine response element " ) , 
preferably in a dose -dependent manner. 

25 The invention provides screening assays for 

identifying agents which modulate expression of genes which 
are regulated by the level of extracellular homocysteine . The 
screening assays can employ a variety of reporter strategies . 
In one embodiment, the relative abundance of at least one 

30 homocysteine-regulated mRNA is determined in a cell culture or 
cellular sample from a whole animal in the presence of and, 
separately, in the absence of an added agent; agents which 
produce a substantial change in the relative abundance of said 
homocysteine-regulated mRNA are thereby identified as 

3 5 candidate pharmaceutical for modifying a pathological 
condition or disease predisposition, such as 

hyperhomocysteinemia and/or atherosclerosis. Typically, the 
assay is performed using cells which are cultured in a culture 
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medium having at least 0.1 tnM homocysteine, often having at 
least 1 mM to about 5 mM homocysteine. 

In one embodiment, candidate therapeutic agents are 
identified by their ability to inhibit homocysteine -induced 
transcription of a homocysteine-regulated gene with minimal or 
undetectable inhibition of other genes, such as GAPDH, which 
are not substantially regulated by extracellular homocysteine 
levels. Preferably, the inhibition of an active agent is 
dose -dependent, and dose -dependent transcriptional modulating 
agents are preferably selected. 

The invention provides for the use of homocysteine- 
regulated polypeptides, homocysteine-regulated 
polynucleotides, and sequences thereof, for diagnosis and 
treatment of disease, for screening compound banks to identify 
candidate pharmaceutical agents, and for development of 
transgenic or knockout mice having an altered genotype and 
optionally an altered phenotype. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. Identification of a homocysteine -inducible 
gene by mRNA differential display. Total RNA from HUVECs 
treated for 18 hr with 0, 0.1, 1.0 or 5.0 mM homocysteine were 
DNase -treated and subjected to mRNA differential display. The 
radiolabeled cDNA fragments were then resolved on 6% 
sequencing gels. A candidate cDNA fragment which demonstrates 
dose -dependent upregulation in the presence of homocysteine is 
marked by an arrow. HC, anchor primer 5' -AAGCT (11 ,C-3 ' . HG, 
anchor primer 5 ' -AAGCT (11) G-3 ' . . HAP1-3, arbitrary primers 5'- 
AAGCTTGATTGCC - 3 ' , 5' -AAGCTTCGACTGT-3 ' and 5 ' - AAGCTTTGGTCAG - 3 ' , 
r espe ct i ve ly . 

Figure 2. Identification of a homocysteine -inducible 
gene showing reproducibility by mRNA differential display. 
Total RNA from Dami cells, treated for 18 hr with 0 (lane 1) 
or 1.0 mM (lane 2) homocysteine, was DNase-treated and 
subjected to mRNA differential display using primers 5'- 
T,n)MG-3' and 5 ' -AGCCAGCGAA-3 ' . The radiolabeled cDNA 
fragments were then resolved on 6% sequencing gels. The 
arrowhead marks the homocysteine -inducible gene (designated 
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dhc-1) which is reproducibly amplified in two separate 
experiments (compare panels A and B) . 

Figure 3. Northern blot analysis using the cDNA 
fragment dhc-1 as a probe. Total RNA from Dami cells treated 
5 for 18 hr with 0 (lane 1) or 1.0 mM (lane 2) homocysteine was 
separated on a formaldehyde/1.2% agarose gel. cDNA fragment 
dhc-1 was labeled by random priming and hybridized as 
described infra. dhc-1 hybridizes specifically to a 1 . 8 kb 
transcript showing homocysteine -dependent upregulation (panel 

10 A) . As ah RNA loading control the same blot was reprobed with 
a GAPDH cDNA (panel B) . 

Figure 4. Northern blot analysis on HUVECs treated 
with homocysteine and probed with dhc-1. Total RNA from 
HUVECS , treated for 18 hr with 0, 0.1, 1.0 and 5 . 0 mM 

15 homocysteine (lanes 1 to 4, respectively) or 2 hr with 1.0 mM 
homocysteine (lane 5) , was separated on a f ormaldeyde/l . 2% 
agarose gel. cDNA fragment dhc-1 was labeled by random 
priming and hybridized as described infra, dhc-1 hybridizes 
specifically to a 1.8 kb transcript showing b dose -dependent 

20 upregulation in response to homocysteine. The 1.8 kb 

transcript was also induced in HUVECs treated for 2 hr with 1 
mM homocysteine (lane 5) . As a RNA loading control, the same 
blot was reprobed with a GAPDH cDNA. 

Figure 5. Partial nucleotide sequence of the dhc-1 

25 cDNA fragment (designated T7MGAP1B . SEQ) shown in panel A. 

Using the GenBank DNA database, dhc-1 was shown to be >96% 
identical to the unidentified human EST designated HHCJ64 
(panel B) . 

3 0 DETAILED DESCRIPTION 

Unless defined otherwise, all technical and 
scientific terms used herein have the same meaning as commonly 
understood by one of ordinary skill in the art to which this 
invention belongs. Although any methods and materials similar 

35 or equivalent to those described herein can be used in the 
practice or testing of the present invention, the preferred 
methods and materials are described. For purposes of the 
present invention, the following terms are defined below. 
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Definitions 

As used herein, the twenty conventional amino acids 
and their abbreviations follow conventional usage (Immunology 
- a Synthesis , 2nd Edition, E.S. Golub and D.R. Gren, Eds . , 
Sinauer Associates, Sunderland, Massachusetts (1991) , which is 
incorporated herein by reference) . 

Unless specified otherwise, the conventional 
notation used herein portrays polynucleotides as follows: the 
lefthand end of single -stranded polynucleotide sequences is 
the 5' end; the lefthand direction of double -stranded 
polynucleotide sequences is referred to as the 5' direction. 
The direction of 5' to 3' addition of nascent RNA transcripts 
is referred to as the transcription direction; sequence 
regions on the DNA strand having the same sequence as the RNA 
and which are 5' to the 5' end of the RNA transcript are 
referred to as "upstream sequences"; sequence regions on the 
DNA strand having the same sequence as the RNA and which are 
3' to the 3' end of the coding RNA transcript are referred to 
as "downstream sequences" . 

As used herein, the term "polynucleotide" refers to 
a polymer composed of a multiplicity of nucleotide units 
(ribonucleotide or deoxyribonucleotide or related structural 
variants) linked via phosphodiester bonds. A polynucleotide 
can be of substantially any length, typically from about 10 
nucleotide to about 1x1 0 9 nucleotide or larger. As used 
herein, an "oligonucleotide" is defined as a polynucleotide of 
from 6 to 100 nucleotide in length. Thus, an oligonucleotide 
is a subset of polynucleotides. 

The term "naturally-occurring" as used herein as 
applied to an object refers to the fact that an object can be 
found in nature. For example, a polypeptide or polynucleotide 
sequence that is present in an organism (including viruses) 
that can be isolated from a source in nature and which has not 
been intentionally modified by man in the laboratory is 
naturally-occurring. Generally, the term naturally-occurring 
refers to an object as present in a non -pathological 
(undiseased) individual, such as would be typical for the 
species . 
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The term "corresponds to" is used herein to mean 
that a polynucleotide sequence is homologous (i.e., is 
identical, not strictly evolutionarily related) to all or a 
portion of a reference polynucleotide sequence. In 
contradistinction, the term "complementary to" is used herein 
to mean that the complementary sequence is homologous to all 
or a portion of a reference polynucleotide sequence. For 
illustration, the nucleotide sequence "TATAC" corresponds to a 
reference sequence "TATAC" and is complementary to a reference 
sequence "GTATA" . 

The following terms are used to describe the 
sequence relationships between two or more polynucleotides: 
"reference sequence", "comparison window", "sequence 
identity", "percentage of sequence identity", and "substantial 
identity" . A "reference sequence" is a defined sequence used 
as a basis for a sequence comparison; a reference sequence may 
be a subset of a larger sequence, for example, as a segment of 
a full-length cDNA or gene sequence given in a sequence 
listing, or may comprise a complete cDNA or gene sequence. 
Generally, a reference sequence is at least 12 nucleotide in 
length, frequently at least 15 to 18 nucleotide in length, and 
often at least 25 nucleotide in length. Since two 
polynucleotides may each (1) comprise a sequence (i.e., a 
portion of the complete polynucleotide sequence) that is 
similar between the two polynucleotides, and (2) may further 
comprise a sequence that is divergent between the two 
polynucleotides, sequence comparisons between two (or more) 
polynucleotides are typically performed by comparing sequences 
of the two polynucleotides over a "comparison window" to 
identify and compare local regions of sequence similarity. 

A "comparison window", as used herein, refers to a 
conceptual segment of at least 12 contiguous nucleotide 
positions wherein a polynucleotide sequence may be compared to 
a reference sequence of at least 12 contiguous nucleotide and 
wherein the portion of the polynucleotide sequence in the 
comparison window may comprise additions or deletions (i.e., 
gaps) of 2 0 percent or less as compared to the reference 
sequence (which does not comprise additions or deletions) for 
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optimal alignment of the two sequences. Optimal alignment of 
sequences for aligning a comparison window may be conducted by 
the local homology algorithm of Smith and Waterman (1981) Adv. 
Appl. Math. 2: 482, by the homology alignment algorithm of 
Needleman and Wunsch (1970) J. Mol . Biol. 48: 443, by the 
search for similarity method of Pearson and Lipman (1988) 
Proc. Natl. Acad. Sci. (U.S.A.) 85: 2444, by computerized 
implementations of these algorithms (GAP, BESTFIT, FASTA, and 
TFASTA in the Wisconsin Genetics Software Package Release 7.0, 
Genetics Computer Group, 575 Science Dr., Madison, WI) , or by 
inspection, and the best alignment (i.e., resulting in the 
highest percentage of homology over the comparison window) 
generated by the various methods is selected. 

The term "sequence identity" means that two 
polynucleotide sequences are identical (i.e., on a nucleotide- 
by-nucleotide basis) over the window of comparison. The term 
"percentage of sequence identity" is calculated by comparing 
two optimally aligned sequences over the window of comparison, 
determining the number of positions at which the identical 
nucleic acid base (e.g., A, T, C, G, U, or I) occurs in both 
sequences to yield the number of matched positions, dividing 
the number of matched positions by the total number of 
positions in the window of comparison (i.e., the window size), 
and multiplying the result by 100 to yield the percentage of 
sequence identity. The terms "substantial identity" as used 
herein denotes a characteristic of a polynucleotide sequence, 
wherein the polynucleotide comprises a sequence that has at 
least 80 percent sequence identity ,. preferably at least 85 
percent identity and often 90 to 95 percent sequence identity, 
more usually at least 99 percent sequence identity as compared 
to a reference sequence over a comparison window of at least 
20 nucleotide positions, frequently over a window of at least 
25-50 nucleotide, wherein the percentage of sequence identity 
is calculated by comparing the reference sequence to the 
polynucleotide sequence which may include deletions or 
additions which total 20 percent or less of the reference 
sequence over the window of comparison. The reference 
sequence may be a subset of a larger sequence. 
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The term "primer" as used herein refers to an 
oligonucleotide whether occurring naturally as in a purified 
restriction digest or produced synthetically, which is capable 
of acting as a point of initiation of synthesis when placed 
5 under conditions in which synthesis of a primer extension 
product which is complementary to a nucleic acid strand is 
induced, i.e., in the presence of nucleotide and an agent for 
polymerization such as DNA polymerase and at a suitable 
temperature and pH. The primer is preferably single- stranded 

10 for maximum efficiency in amplification, but may alternatively 
be double stranded- If double stranded, the primer is first 
treated to separate its strands before being used to prepare 
extension products. Preferably, the primer is an 
oligodeoxyribonucleotide . The primer must be sufficiently long 

15 to prime the synthesis of extension products in the presence 
of the agent for polymerization. The exact lengths of the 
primers will depend on many factors, including temperature and 
source of primers. For example, depending on the complexity 
of the target sequence, the oligonucleotide primer typically 

20 contains 15-25 or more nucleotides, although it may contain 
fewer nucleotide. Short primer molecules generally require 
lower temperatures to form sufficiently stable hybrid 
complexes with its template. In some embodiments, the primers 
can be large polynucleotides, such as from about 200 

25 nucleotide to several kilobases or more. The primers herein 
are selected to be substantially complementary to the 
different strands of each specific sequence to be amplified. 
The primers must be sufficiently complementary to hybridize 
with their respective strands. Therefore, the primer sequence 

30 need not reflect the exact sequence of the template. For 
example, a non- complementary nucleotide fragment may be 
attached to the 5' end of the primer, with the remainder of 
the primer sequence being complementary to the strand. 
Alternatively, noncomplementary bases or longer sequences can 

3 5 be interspersed into the primer, provided that the primer 

sequence has sufficient complementarity with the sequence of 
the strand to be amplified to hybridize therewith and thereby 
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form a template for synthesis of the extension product of the 
other primer. 

As used herein, "suitable reaction conditions" are 
those conditions suitable for conducting PCR amplification 
using conventional reagents. Such conditions are known or 
readily established by those of skill in the art, and can be 
exemplified by the reaction conditions used in U.S. Patents 
4,683,202, 4,683,195, and 4,800,159, which are incorporated 
herein by reference. As one example and not to limit the 
invention, suitable reaction conditions can comprise: 0.2mM 
each dNTP, 2.2 mM MgCl 2 , 50 mM KC1 , 10 mM Tris-HCl pH 9.0, 
0.1% Triton X-100. 

As used herein the term "physiological conditions" 
re fers to temperature, pH, ionic strength, viscosity, and like 
biochemical parameters which are compatible with a viable 
organism, and/or which typically exist intracellularly in a 
viable cultured yeast cell or mammalian cell. For example, 
the intracellular conditions in a yeast cell grown under 
typical laboratory culture conditions are physiological 
conditions. Suitable in vitro reaction conditions for PCR and 
many polynucleotide enzymatic reactions and manipulations are 
generally physiological conditions. In general, in vitro 
physiological conditions comprise 50-200 mM NaCl or KC1, pH 
6.5-8.5, 20-45°C and 0.001-10 mM divalent cation (e.g., Mg ++ , 
Ca 4f ) ; preferably about 150 mM NaCl or KC1, pH 7.2-7.6, 5mM 
divalent cation, and often include 0.01-1.0 percent 
nonspecific protein (e.g., BSA) . A non-ionic detergent 
(Tween, NP-40, Triton X-10 0) can often be present, usually at 
about 0.001 to 2%, typically 0.05-0.2% (v/v) . Particular 
aqueous conditions may be selected by the practitioner 
according to conventional methods. For general guidance, the 
following buffered aqueous conditions may be applicable: 10- 
250 mM NaCl, 5-50 mM Tris HC1, pH 5-8, with optional addition 
of divalent cation (s) and/or metal chelators and/or nonionic 
detergents and/or membrane fractions and/or antifoam agents 
and/or scintillants . 

As used herein, the terms "label" or "labeled" 
refers to incorporation of a detectable marker, e.g. , by 
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incorporation of a radiolabeled nucleotide or incorporation of 
nucleotide having biotinyl moieties that can be detected by- 
marked avidin (e.g., streptavidin containing a fluorescent 
marker or enzymatic activity that can be detected by optical 
5 or calorimetric methods) . Various methods of labeling 
polynucleotides are known in the art and may be used. 
Examples of labels include, but are not limited to, the 
following: radioisotopes (e.g., 3 H, 14 C, 35 S, 125 I, 131 I) , 
fluorescent labels (e.g., FITC, rhodamine, lanthanide 
10 phosphors), enzymatic labels (e.g., horseradish peroxidase, (3- 
galactosidase, lucif erase, alkaline phosphatase) , biotinyl 
groups, and the like. In some embodiments, labels are 
attached by spacer arms of various lengths to reduce potential 
steric hindrance. 

15 As used herein, "substantially pure" means an object 

species is the predominant species present (i.e., on a molar 
basis it is more abundant than any other individual 
macromolecular species in the composition) , and preferably a 
substantially purified fraction is a composition wherein the 

20 object species comprises at least about 50 percent (on. a molar 
basis) of all macromolecular species present. Generally, a 
substantially pure composition will comprise more than about 
8 0 to 90 percent of all macromolecular species present in the 
composition. Most preferably, the object species is purified 

2 5 to essential homogeneity (contaminant species cannot be 

detected in the composition by conventional detection methods) 
wherein the composition consists essentially of a single 
macromolecular species. Solvent species, small molecules 
(<500 Daltons) , and elemental ion species are not considered 

3 0 macromolecular species. 

The term "recombinant" used herein refers to 
macromolecules produced by recombinant DNA techniques wherein 
the gene coding for a polypeptide is cloned by known 
recombinant DNA technology. For example, an amplified or 
3 5 assembled product polynucleotide may be inserted into a 

suitable DNA vector, such as a bacterial plasmid, and the 
plasmid used to transform a suitable host. The gene is then 
expressed in the host to produce the recombinant protein. The 
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transformed host may be prokaryotic or eukaryotic, including 
mammalian, yeast , Aspergillus and insect cells. One preferred 
embodiment employs bacterial cells as the host . 
Alternatively, the product polynucleotide may serve a non- 
coding function (e.g., promoter, origin of replication, 
ribosome -binding site, etc.). 

As used herein "normal blood" or "normal human 
blood" refers to blood from a healthy human individual who 
does not have active coronary artery disease, atherosclerotic 
disease, or other vascular disorder, or an identified 
predisposition for developing a vascular disease; such 
individuals have serum homocysteine levels within or below the 
normal range for the age-adjusted population and exhibit 
typical metabolic response to methionine overloading; such 
individuals are typically normotensive and can be identified 
by various means, including coronary arterial scans, and the 
like. Similarly, "normal serum", "normal plasma", "normal 
urine", and "normal saliva" refer to the respective fluids 
obtained from a healthy human individual who does not have 
active atherosclerotic disease, homocysteinemia, or other 
identified condition associated with a vascular pathological 
disorder . 

As used herein "pathognomonic concentration" refers 
to a concentration of an analyte in a sample, for example the 
concentration of a protein in a blood, plasma, serum, urine, 
or saliva sample, or a concentration of a protein or mRNA in a 
cell (e.g., nucleated cell of a hematopoietic lineage) of a 
cellular sample obtained from an individual, wherein the 
concentration of the analyte is indicative of the presence of 
a specific disease or a predisposition to developing a 
specific disease, such as atherosclerotic disease and/or 
homocysteinemia. A pathognomonic concentration is a 
concentration of an analyte that falls outside the range of 
normal clinical values that is established by prospective 
and/or retrospective statistical clinical studies. Generally, 
an individual having an atherosclerotic disease (e.g., 
coronary or carotid artery lumen narrowing) and/or clinical 
hyperhomocysteinemia will exhibit a concentration of the 
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analyte in a tissue or biological fluid sample that is higher 
than the range of concentrations that characterize normal, 
undiseased individuals; typically the pathognomonic 
concentration is at least about one standard deviation above 
5 the mean normal value, more usually it is at least about two 
standard deviations or more above the mean normal value. 
However, essentially all clinical diagnostic tests produce 
some percentage of false positives and false negatives. The 
sensitivity and selectivity of the diagnostic assay must be 

10 sufficient to satisfy the diagnostic objective and any 

relevant regulatory requirements. In general, the diagnostic 
methods of the invention are used to identify individuals as 
disease candidates, providing an additional parameter in a 
differential diagnosis of disease made by a competent health 

15 professional. Most typically the analyte is a protein or mRNA 
expressed in a nucleated blood cell, endothelial cell, or 
vascular smooth muscle or fibroblastic cell, wherein the 
abundance of the protein or mRNA species is elevated or 
reduced by at least one standard deviation from the 

20 established mean population value by the presence of elevated 
levels of homocysteine characteristic of homocysteinemia . 

As used herein, the term "reporter polynucleotide" 
refers to a polynucleotide sequence comprising a 
transcriptional regulatory sequence operably linked to a 

25 structural sequence encoding a reporter protein (e.g., /3- 

galactosidase, lucif erase, chloramphenicol acetyl transferase, 
green fluorescent protein) . Preferred transcriptional 
regulatory sequences are obtained from the promoter region and 
5' flanking region of homocysteine -regulated genes. 

30 The term "transcriptional modulation" is used herein 

to refer to the capacity to either enhance transcription or 
inhibit transcription of a structural sequence linked in cis; 
such enhancement or inhibition may be contingent on the 
occurrence of a specific event, such as stimulation with an 

35 inducer (e.g., homocysteine) and/or may only be manifest in 
certain cell types (e.g., enothelial cells, nucleated blood 
cells) . The altered ability to modulate transcriptional 
enhancement or inhibition may affect the inducible 
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transcription of a gene, or may effect the basal level 
transcription of a gene, or both. For example, a reporter 
polynucleotide may comprise a homocysteine- inducible enhancer- 
promoter driving transcription of a sequence encoding a 
5 reporter protein. Such a reporter polypeptide may be 

transferred to a homocysteine -responsive cell line for use as 
a reporter host cell to screen a cDNA expression library for 
cloned sequences that affect homocysteine -induced 
transcriptional modulation. Cloned sequences that enhance 
10 transcription of the reporter in the absence of homocysteine 
may be identified as putative positive regulators of 
homocysteine -response elements, whereas cloned sequences that 
silence expression of the reporter in cells cultured in the 
presence of homocysteine may be identified (e.g., by selecting 
15 cells not expressing the reporter) as sequences encoding 

proteins that interfere with the expression of homocysteine- 
responsive genes. Numerous other specific examples of 
transcription regulatory elements, such as specific enhancers 
and silencers, are known to those of skill in the art and may 
20 be selected for use in the methods and polynucleotide 

constructs of the invention on the basis of the practitioner's 
desired application. Literature sources and published patent 
documents, as well as GenBank* and other sequence information 
data sources can be consulted by those of skill in the art in 
25 selecting suitable transcription regulatory elements for use 

in the invention. Where necessary, a transcription regulatory 
element may be constructed by synthesis (and ligation, if 
necessary) of oligonucleotides made on the basis of available 
sequence information. 
3 0 As used herein, the term "operably linked" refers to 

a linkage of polynucleotide elements in a functional 
relationship. A nucleic acid is "operably linked" when it is 
placed into a functional relationship with another nucleic 
acid sequence. For instance, a promoter or enhancer is 
3 5 operably linked to a coding sequence if it affects the 

transcription of the coding sequence. Operably linked means 
that the DNA sequences being linked are typically contiguous 
and, where necessary to join two protein coding regions, 
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contiguous and in reading frame. However, since enhancers 
generally function when separated from the promoter by several 
kilobases and intronic sequences may be of variable lengths, 
some polynucleotide elements may be operably linked but not 
5 contiguous. 

As used herein, a "homocysteine -regulated mRNA" or 
"homocysteine -responsive mRNA" is a mRNA species which is at 
least 2- fold increased in relative abundance in the total mRNA 
pool or at least 50 percent decreased in relative abundance in 
10 the total mRNA pool in cells exposed to hyperphysiological 

levels of homocysteine (e.g., 1 mM to about 5 mM for cultured 
endothelial or megakaryocytic cells) and/or cells explanted 
from an individual having transient or chronic 
hyperhomocysteinemia . 

15 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Generally, the nomenclature used hereafter and many 
of the laboratory procedures in cell culture, molecular 
genetics, and nucleic acid chemistry and hybridization 

20 described below are those well known and commonly employed in 
the art. Standard techniques are used for recombinant nucleic 
acid methods, polynucleotide synthesis, in vitro polypeptide 
synthesis, and the like and microbial culture and 
transformation (e.g., electroporation) . Generally enzymatic 

25 reactions and purification steps are performed according to 
the manufacturer's specifications. The techniques and 
procedures are generally performed according to conventional 
methods in the art and various general references ( see , 
generally , Sambrook et al . Molecular Cloning: A Laboratory 

3 0 Manual, 2d ed. (1989) Cold Spring Harbor Laboratory Press, 

Cold Spring Harbor, N.Y.; each of which is incorporated herein 
by reference) which are provided throughout this document. 
The procedures therein are believed to be well known in the 
art and are provided for the convenience of the reader. All 

35 the information contained therein is incorporated herein by 
reference . 

Oligonucleotides can be synthesized on an Applied 
Bio Systems oligonucleotide synthesizer according to 
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specifications provided by the manufacturer. The 
polynucleotide primers may be prepared using any suitable 
method, such as, for example, the phosphotriester and 
phosphodiester methods, or automated embodiments thereof . In 
5 one such automated embodiment diethylphosphoramidites are used 
as starting materials and may be synthesized as described by 
Beaucage et al . (1981) Tetrahedron Letters 22 ; 1859. One 
method for synthesizing oligonucleotides on a modified solid 
support is described in U.S. Pat. No. 4,458,066. It is also 
10 possible to use a primer which has been isolated from a 

biological source (such as a restriction endonuclease digest 
or the like. 

Methods for PCR amplification are described in the 
art ( PCR Technology: Principles and Applications for DNA 

15 Amplification ed. HA Erlich, Stockton Press, New York, NY 
(1989) ; PCR Protocols: A Guide to Methods and Applications , 
eds . Innis, Gelfland, Snisky, and White, Academic Press, San 
Diego, CA (1990); Mattila et al. (1991) Nucleic Acids Res. 19 : 
4 967; Eckert, K.A. and Kunkel, T.A. (1991) PCR Methods and 

20 Applications i: 17; and U.S. Patent Nos . 4,683,202 and 

4,965,188, each of which are incorporated herein by reference) 
and exemplified hereinbelow. 

It is evident that optimal PCR and hybridization 
conditions will vary depending upon the sequence composition 

25 and length(s) of the targeting polynucleotide (s) and 

target (s) , and the experimental method selected by the 
practitioner. Various guidelines may be used to select 
appropriate primer sequences and hybridization conditions 
(see, Maniatis et al . , Molecular Cloning: A Laboratory Manual 

30 (1989), 2nd Ed., Cold Spring Harbor, N.Y. ; Berger and Kimmel, 
Methods in Enzvmolocrv, Volume 152, Guide to Molecular Cloning 
Techniques (1987), Academic Press, Inc., San Diego, CA; PCR 
Protocols: A Guide to Methods and Applications , eds. Innis, 
Gelfland, Snisky, and White, Academic Press, San Diego, CA 

35 (1990) ; Benton WD and Davis RW (1977) Science 196 : 180; 

Goodspeed et al . (1989) Gene 76: 1; Dunn et al . (1989) J . 
Biol. Chem. 264 : 13057 which are incorporated herein by 
reference . 
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Overview 

Mounting evidence has shown that elevated blood 
levels of homocysteine represents an independent risk factor 
for vascular disease. Presently, however, there is no simple 
5 and/or reliable test to determine individuals with 

hyperhomocysteinemia. In addition, there is no strategy to 
measure the success of presently available treatments which 
lower blood homocysteine- levels . 

The invention discloses an approach to identify 

10 homocysteine -induced genes for the successful diagnosis and 

treatment of vascular disease. By identifying these genes and 
their gene products we will be able to better understand the 
mechanisms involved in homocysteine -induced vascular disease 
so that rational strategies for the detection and prevention 

15 of hyperhomocysteinemia can be pursued. 

The invention discloses a method which employs mRNA 
differential display (Liang and Pardee, 1992; Liang et al . 
1993) to identify specific homocysteine -induced genes. The 
identification of these genes provides a direct insight into 

20 the cytotoxic effect of homocysteine, thus leading to new 

strategies in the detection and prevention of homocysteine - 
induced vascular disease and thrombosis* 

A basis of the present invention is the use of 
differential display, comprising (1) obtaining two mRNA 

25 populations wherein a first mRNA population represents mRNA 
species expressed in cells exposed to elevated levels of 
homocysteine or obtained directly from hyperhomocysteinemia 
patients, and a second mRNA population represents mRNA species 
expressed in the same type of cells cultured with 

3 0 physiological levels of homocysteine or deficient in 

homocysteine or obtained directly from a normal patient having 
a homocysteine level at or below the population mean, and (2) 
PCR amplification, under suitable PCR conditions, of said mRNA 
populations with a 5' primer of arbitrary nucleotide sequence 

35 and optionally with a 3' primer comprising poly(dT) and/or 

poly(dT) and two or more arbitrary nucleotide at the 3' end to 
generate PCR products, and (3) differential display of said 
PCR products on an electrophoretic gel and recovery of 
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individual species of said PCR products by recovery from the 
gel, wherein the recovered species are identified as 
homocysteine -regulated mRNA sequences based on differential 
expression levels shown by distinct signal intensity on the 
gel . 

In a specific embodiment, the method is used to 
isolate and identify polynucleotides having sequences 
complementary to or corresponding to a coding or non-coding 
portion of a naturally occurring mRNA which is increased more 
than 2 -fold in relative abundance in the total mRNA pool in 
cells exposed to conditions of elevated homocysteine as 
compared to cells exposed to conditions of physiological 
concentrations or subphysiological concentrations of 
homocysteine . 

Subtractive hybridization can be useful for 
enriching a pool of up-regulated cDNA species from mRNA 
expressed in cells exposed to hyperphysiological homocysteine, 
and conventional screening of the resultant library (ies) will 
identify up-regulated mRNA species in the selected cDNA pool. 

Differential Display of Amplifi ed Products 
The general strategy is to amplify partial cDNA 
sequences from subsets of species in the mRNA pool obtained 
from cells exposed to elevated homocysteine levels and 
separately from the mRNA pool obtained from cells exposed to 
normal homocysteine levels, by PCR (RT-PCR) amplification (or 
other amplification means) using one or a series of arbitrary 
sequence primers. Arbitrary primers are selected according to 
various criteria at the discretion of the practitioner so that 
each will amplify only a fraction of the species in the mRNA 
pool so that the amplification products can be resolved and 
individually recovered on a separation system, such as a 
polyacrylamide gel (e.g., a DNA sequencing PAGE format). In 
part because the number and complexity of mRNA species 
represented in any particular mRNA pool may vary considerably 
depending upon the nature and complexity of the cell type(s) 
and culture conditions, the selection of arbitrary primers and 
their sequence (s) are determined by the practitioner with 
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reference to the literature ( see , Liang and Pardee (1992) 
Science 257 : 967; Liang et al . (1992) Cancer Res. 52: 6966; 
Liang et al . (1993) Nucleic Acids Res. 21: 3269; Utans et al . 

(1994) Proc. Natl. Acad. Sci - (U.S.A.) 91 : 64 63; Zimmermann et 
5 al. (1994) Proc. Natl. Acad. Sci. (U.S.A.) 91: 5456; Wang and 

Feuer stein (1994) Biotechniques : ; Wang and Brown 

(1991) Proc Natl Acad Sci USA 88 : 11505; Fischer et al, 

(1995) Proc. Natl. Acad. Sci. (U.S.A.) 92: 5331; Lohmann et 
al. (1995) Biotechniques 18: 200; Reeves et al. (1995) 

10 Biotechniques 18 : 18; Maser et al . (1995) Semin Nephrol 15: 

29; Chen et al . (1994) Biotechniques 16 : 1002; Callard et al . 
(1994) Biotechniques 16 : 1096; and Mou et al . (1994) Biochem 
Biophys Res Commun 199 : 564, incorporated herein by 
reference) . 

15 The mRNA pool is amplified with suitable arbitrary 

primer (s) (i.e., primer having a predetermined sequence which 
is selected without reference to a sequence of a desired 
differentially expressed mRNA; as described in Liang and 
Pardee (1992) op. cit ) for a suitable number of amplification 

20 cycles to generate sufficient amplification product for 
display and recovery of desired species; this can be 
determined empirically by the practitioner. The primer (s) may 
comprise 5' -terminal sequences which serve to anchor other PCR 
primers (distal primers) and/or which comprise a restriction 

25 site or half-site or other ligatable end. The amplified 

products are usually labeled and are typically are resolved by 
electrophoresis on a polyacrylamide gel; the location (s) where 
label is present are excised and the labeled product species 
is/are recovered from the gel portion, typically by elution. 

3 0 The resultant recovered product species (ESTs) can be 

subcloned into a replicable vector with or without attachment 
of linkers, amplified further, and/or sequenced directly. 
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Subtractive Hybridization 
An alternative approach is to use subtractive 
hybridization, wherein RNA prepared by conventional methods 
from a first cell population and RNA from a second cell 
population are separately reverse -transcribed and second- 
strand synthesized to form two pools of double -stranded cDNA, 
a tester pool comprising sequences of the mRNA species (s) 
desired to be enriched for, and a driver pool comprising the 
sequences desired to be subtracted from the tester pool. The 
two pools may be fragmented by endonuclease digestion 
(restriction endonuclease or non-specific endonuclease) - if 
desired to enhance hybridization efficiency. The driver pool 
is labeled, such as by photobiotinylation or attachment of 
another suitable recoverable label. The driver pool and 
tester pool are denatured and mixed together in a reaction 
mixture under hybridization conditions and incubated for a 
suitable hybridization period. The reaction mixture is 
contacted with a ligand which binds the recoverable label on 
the driver cDNA and which can be readily recovered from the 
reaction mixture (e.g., avidin attached to magnetic beads), 
such that a substantial fraction of the driver cDNA and any 
tester cDNA hybridized thereto is selectively removed from the 
reaction mixture. The remaining reaction mixture is enriched 
for tester cDNA species which are preferentially expressed in 
the first cell population as compared to the second cell 
population. The enriched (subtracted) tester cDNA pool may be 
subjected to one or more additional rounds of subtractive 
hybridization with a pool of labeled driver cDNA, which may be 
substantially identical to the initial pool of driver cDNA or 
which may represent a different cell population having mRNA 
species which are desired to be subtracted from the subtracted 
tester cDNA pool . A variety of means for accomplishing the 
subtractive hybridization (s) and suitable methodological 
guidance are available to the artisan ( see , Lee et al. (1991) 
Proc. Natl. Acad- Sci . (U.S.A.) 88: 2825; Milner et al . (1995) 
Nucleic Acids Res. 23 : 176; Luqmani et al . (1994) Anal. 
Biochem. 222: 102; Zebrowski et al . (1994) Anal. Bi ochem. 222: 
285; Robertson et al . (1994) Genomics 23: 42; Rosenberg et al . 



WO 97/25440 PCT/CA97/00001 

39 

(1994) Proc. Natl. Acad. Sci . -(U.S.A.) .91: 6113; Li et al . 

(1994) Biotechniaues 16 : 722; Hakvoort et al . (1994) Nucleic 
Acids Res. 22 : 878; Satoh et al . (1994) Mutat . Res. 316: 25; 
Austruy et al- (1993) op.cit : Marechal et al . (1993) Anal . 
5 Biochem. 208 : 330; El-Deiry et al . (1993) Cell 75: 817; Hara 
et al. (1991) Nucleic Acids Res. 19: 7097; and Herfort and 
Garber (1991) Biotechniaues 11: 598, each incorporated herein 
by reference) . cDNA species remaining in the tester pool can 
be cloned, amplified, and/or sequenced for unambiguous 
10 identification, such as in the form of an EST sequence. 

Full -Length cDNA 
Once the EST(s) are recovered, they may be used to 
obtain a substantially full-length cDNA from a cDNA library. 

15 The EST(s) can be sequenced and the sequence information used 
to generate a primer for primer extension (5 ' -RACE technique) 
or the EST can be labeled and used as a hybridization probe to 
identify larger cDNA clones from a cDNA library. 

Genomic or cDNA clones of the subtracted cDNA 

20 species may be isolated from clone libraries (e.g., available 
from Clontech, Palo Alto, CA) using the labeled EST (e.g., by 
nick- translation or end- labeling) or using hybridization 
probes designed on the basis of the nucleotide sequences 
identified and using conventional hybridization screening 

25 . methods (e.g., Benton WD and Davis RW (1977) Science 196 : 180; 

Goodspeed et al . (1989) Gene 76 : 1) . Where a full-length cDNA 
clone is desired, clone libraries containing cDNA derived from 
somatic cell mRNA. Alternatively, synthetic polynucleotide 
sequences corresponding to all or part of the EST sequences 

30 may be constructed by chemical synthesis of oligonucleotides. 
Additionally, polymerase chain reaction (PCR) using primers 
based on the sequence data disclosed in the EST sequences may 
be used to amplify DNA fragments from genomic DNA, mRNA pools, 
or from cDNA clone libraries. U.S. Patents 4,683,195 and 

35 4,683,202 describe the PCR method. Additionally, PCR methods 
employing one primer that is based on the EST sequence data 
and a second primer that is not based on that sequence data 
may be used. 
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Disclosure of the full coding sequences for the mRNA 
corresponding to the selected ESTs, such as full-length cDNAs 
for the mammalian homocysteine -regulated sequences, such as 
dhc-1 shown in Fig. 5, makes possible the construction of 
5 isolated polynucleotides that can direct the expression of 
homocysteine-regulated gene products (proteins) , fragments 
thereof, or analogs thereof. Further , the EST sequences of 
predetermined homocysteine-regulated mRNA species make 
possible the construction of nucleic acid hybridization probes 
10 and PGR primers that can be used to detect RNA and DNA 

sequences encoding the homocysteine-regulated gene products. 

Polynucleotides encoding full-length protein or 
fragments or analogs thereof, may include sequences that 
facilitate transcription (expression sequences) and 
15 translation of the coding sequences, such that the encoded 
polypeptide product is produced. Construction of such 
polynucleotides is well known in the art and is described 
further in Maniatis et al . , Molecular Cloning: A Laboratory 
Manual . 2nd Ed. (1989), Cold Spring Harbor, N. Y. For example, 
20 but not for limitation, such polynucleotides can include a 
promoter, a transcription termination site (polyadenylation 
site in eukaryotic expression hosts) , a ribosome binding site, 
and, optionally, an enhancer for use in eukaryotic expression 
hosts, and, optionally, sequences necessary for replication of 
25 a vector. A typical eukaryotic expression cassette will 

include a polynucleotide sequence encoding a homocysteine- 
regulated polypeptide linked downstream (i.e., in 
translational reading frame orientation; polynucleotide 
linkage) of a promoter such as the HSV tk promoter or the pgk 
30 (phosphoglycerate kinase) promoter, optionally linked to an 

enhancer and a downstream polyadenylation site (e.g., an SV40 
large T Ag poly A addition site) . 

Preferably, these amino acid sequences occur in the 
given order (in the amino- terminal to carboxy- terminal 
35 orientation) and may comprise other intervening and/or 

terminal sequences; generally such polypeptides are less than 
100 0 amino acids in length, more usually less than about 500 
amino acids in lengths, and frequently approximately 200 amino 
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acids in length. The degeneracy of the genetic code gives a 
finite set of polynucleotide sequences encoding these amino 
acid sequences; this set of degenerate sequences may be 
readily generated by hand or by computer using commercially 
5 available software (Wisconsin Genetics Software Package 

Release 7.0). Isolated polynucleotides typically are less 
than approximately 10,000 nucleotide in length. 

Additionally, where expression of a polypeptide is 
not desired, polynucleotides of this invention need not encode 

10 a functional protein. Polynucleotides, of this invention may 
serve as hybridization probes and/or PCR primers (amplimers) 
and/or LCR oligomers for detecting homocy s t e ine - r egul a t ed RNA 
or DNA sequences . 

Alternatively, polynucleotides of this invention may 

15 serve as hybridization probes or primers for detecting RNA or 
DNA sequences of related genes, such genes may encode 
structurally or evolutionarily related proteins. For such 
hybridization and PCR applications, the polynucleotides of the 
invention need not encode a functional polypeptide . Thus,*, 

20 polynucleotides of the invention may contain substantial 
deletions, additions, nucleotide substitutions and/or 
transpositions, so long as specific hybridization or specific 
amplification to a predetermined homocysteine -regulated mRNA 
sequence is retained. 

25 

General PCR Methods 
PCR synthesis can be performed using any suitable 
method. Generally it occurs in a buffered aqueous solution, 
preferably at a pH of 7-9, most preferably about 8. The 

30 bivalent primer (s) is/are added in suitable amounts (molar 

ratio to target) , typically less than conventional PCR methods 
because of the self -priming nature of the overlapped 
concatemers. The deoxyribonucleoside triphosphates dATP, 
dCTP, dGTP and TTP are also added to the synthesis mixture in 

3 5 adequate amounts and the resulting solution is heated to about 
85° »100° C. for from about 1 to 10 minutes, preferably from 1 
to 4 minutes. After this heating period the solution is 
allowed to cool to from 20° -40° C, which is preferable for the 
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primer hybridization. To the cooled mixture is added an agent 
for polymerization, and the reaction is allowed to occur under 
conditions known in the art. This synthesis reaction may occur 
at from room temperature up to a temperature above which the 
agent for polymerization no longer functions efficiently. 
Thus, for example, if DNA polymerase is used as the agent for 
polymerization, the temperature is generally no greater than 
about 45° C. The agent for polymerization may be any compound 
or system which will function to accomplish the synthesis of 
primer extension products, including enzymes. Suitable enzymes 
for this purpose include, for example, E. coli DNA polymerase 
I, Klenow fragment of E. coli DNA polymerase I, T4 DNA 
polymerase, other available DNA polymerases, reverse 
transcriptase, and other enzymes, including heat -stable 
enzymes, which will facilitate combination of the nucleotide 
in the proper manner to form the primer extension products 
which -are complementary to each nucleic acid strand. 
Generally, the synthesis will be initiated at the 3' end of 
the primer and proceed in the 5' direction along the template 
strand, until synthesis terminates, producing molecules of 
different lengths . 

The newly synthesized strand and its complementary 
nucleic acid strand form a double -stranded molecule which is 
used in the succeeding steps of the process. In the next step, 
the strands of the double -stranded molecule are separated 
using any of the procedures described above to provide 
single- stranded molecules. 

The steps of strand separation and extension product 
synthesis can be repeated as often as needed to produce the 
desired quantity of the specific nucleic acid sequence. The 
amount of the specific nucleic acid sequence produced will 
accumulate in an exponential fashion and the average size of 
the product will also increase as the length of the 
concatemers increases with each cycle. 



Tsolation of the Cognate Genes 
Genomic gene clones or full-length cDNA clones 
corresponding to a homocysteine -regulated mRNA EST sequence 
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are identified and isolated by screening a mammalian genomic 
or cDNA clone library, such as a human, rat, rabbit, or other 
genomic or cDNA library in yeast artificial chromosomes, 
cosmids, or bacteriophage X (e.g., X Charon 35), with a 
5 polynucleotide probe comprising a sequence of about at least 

25 contiguous nucleotide (or their complement) of the EST cDNA 
sequence, such as that shown in Fig. 5 for dhc-1. Typically, 
hybridization and washing conditions are performed at high 
stringency according to conventional hybridization procedures. 

10 Positive clones are isolated and sequenced. For illustration 
and not for limitation, a full-length polynucleotide 
corresponding to the sequence of Fig. 5 may be labeled and 
used as a hybridization probe to isolate genomic clones from a 
human or murine genomic clone library in XEMBL4 or XGEM11 

15 (Promega Corporation, Madison, Wisconsin) ; typical 

hybridization conditions for screening plaque lifts (Benton 
and Davis (1978) Science 196 : 180) can be: 50% formamide, 5 x 
SSC or SSPE, 1-5 x Denhardt's solution, 0.1-1% SDS, 100-200 /xg 
sheared heterologous DNA or tRNA, 0-10% dextran sulfate, 1 

20 xlO 5 to 1 x 10 7 cpm/ml of denatured probe with a specific 

activity of about 1 x 10 8 cpm//xg, and incubation at 42°C-37°C 
for about 6-3 6 hours. Prehybridization conditions are 
essentially identical except that probe is not included and 
incubation time is typically reduced. Washing conditions are 

25 typically 1-3 x SSC, 0.1-1% SDS, 50-70°C with change of wash 
solution at about 5-30 minutes. For isolating mouse 
homocysteine -regulated protein polynucleotides with a human 
polynucleotide probe, it is often preferred to hybridize at 
approximately 3 9°C and to wash sequentially at the following 

30 step temperatures: room temperature, 37°C, 39°C, 42°C, 45°C, 

50°C, 55°C, 60°C, 65°C, and 70°C, stopping after each step and 
monitoring the background probe signal (and optionally 
detecting signal by autoradiogram and/or phosphor imaging, if 
radiolabeled probe is used) and terminating the washing steps 

35 when suitable signal/noise ratio is achieved, as determined 
empirically. 

Other cDNAs and genomic clones (i.e., cognate 
nonhuman genes) can be analogously isolated from various 
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nonhuman cDNA and genomic clone libraries available in the art 
(e.g., Clontech, Palo Alto, CA) by using probes based on the 
sequences shown in Fig. 5 with hybridization and washing 
conditions typically being less stringent. 

Polynucleotides comprising sequences of 
approximately at least 30-50 nucleotides, preferably at least 
100 nucleotide, corresponding to or complementary to the 
nucleotide sequences shown in Fig. 5 can serve as PCR primers 
and/or hybridization probes for identifying and isolating 
germline genes corresponding to the EST sequence. These 
germline genes may be human or may be from a related mammalian 
species, preferably rodents or primates. Such germline genes 
may be isolated by various methods conventional in the art, 
including, but not limited to, by hybridization screening of 
genomic libraries in bacteriophage X or cosmid libraries, or 
by PCR amplification of genomic sequences using primers 
derived from the sequences shown in Fig. 5. Human genomic 
libraries are publicly available or may be constructed de novo 
from human DNA. 

It is apparent to one of skill in the art that 
nucleotide substitutions, deletions, and additions may be 
incorporated into the polynucleotides of the invention. 
Nucleotide sequence variation may result from sequence 
polymorphisms of various alleles, minor sequencing errors, and 
the like. However, such nucleotide substitutions, deletions, 
and additions should not substantially disrupt the ability of 
the polynucleotide to hybridize to one of the polynucleotide 
sequences shown in Fig. 5 under hybridization conditions that 
are sufficiently stringent to result in specific 
hybridization. 

Specific hybridization is defined herein as the 
formation of hybrids between a probe polynucleotide (e.g., a 
polynucleotide of the invention which may include 
substitutions, deletion, and/or additions) and a specific 
target polynucleotide (e.g., a polynucleotide having the 
sequence in Fig. 5) wherein the probe preferentially 
hybridizes to the specific target such that, for example, a 
single band corresponding to one or more of the RNA species 
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(or alternatively spliced mRNA species) can be identified on a 
Northern blot of RNA prepared from a suitable cell source 
(e.g., a somatic cell). Polynucleotides of the invention and 
recombinant ly produced homocysteine- regulated protein, and 
5 fragments or analogs thereof, may be prepared on the basis of 
the sequence data provided in Fig. 5 according to methods 
known in the art and described in Maniatis et al . , Molecular 
Cloning: A Laboratory Manual . 2nd Ed., (1989), Cold Spring 
Harbor, N.Y. and Berger and Kimmel, Methods in Enzymoloay. 

10 Volume 152. Guide to Molecular Cloning Techniques (1987) , 

Academic Press, Inc., San Diego, CA, which are incorporated 
herein by reference. 

Homocysteine -regulated polynucleotides may be short 
oligonucleotides (e.g., 20-100 bases long), such as for use as 

15 hybridization probes and PCR (or LCR) primers. These 

polynucleotide sequences may also comprise part of a larger 
polynucleotide (e.g., a cloning vector comprising a 
homocysteine -regulated protein clone) and may be fused, by 
polynucleotide linkage, in frame with another polynucleotide 

20 sequence encoding a different protein (e.g., glutathione S- 
transf erase or /?-galactosidase) for encoding expression of a 
fusion protein. Typically, homocysteine-regulated 
polynucleotides comprise at least 25 consecutive nucleotide 
which are substantially identical to a naturally-occurring 

25 homocysteine-regulated sequence (e.g., Fig. 5), more usually 
the polynucleotides comprise at least 50 to 100 consecutive 
nucleotide which are substantially identical to a naturally- 
occurring homocysteine-regulated sequence. However, it will 
be recognized by those of skill that the minimum length of a 

30 polynucleotide required for specific hybridization to a target 
sequence will depend on several factors: G/C content, 
positioning of mismatched bases (if any) , degree of uniqueness 
of the sequence as compared to the population of target 
polynucleotides, and chemical nature of the polynucleotide 

35 (e.g., methylphosphonate backbone, polyamide nucleic acid, 
phosphorothiolate, etc.), among others. 

If desired, PCR amplimers for amplifying 
substantially full-length cDNA copies may be selected at the 
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discretion of the practitioner. Similarly, amplimers to 
amplify single exons or portions of the gene (murine or human) 
may be selected. 

Each of these sequences may be used as hybridization 
5 probes or PCR amplimers to detect the presence of 

homocysteine -regulated mRNA, for example to diagnose a 
vascular disease characterized by the presence of an elevated 
or reduced homocysteine -regulated mRNA level in cells, or to 
perform tissue typing (i.e., identify tissues characterized by 

10 the expression of homocysteine -regulated mRNA), and the like. 
The sequences may also be used for detecting genomic 
homocysteine -regulated gene sequences in a DNA sample, such as 
for forensic DNA analysis (e.g., by RFLP analysis, PCR product 
length(s) distribution, etc.) or for diagnosis of diseases 

15 characterized by amplification and/or rearrangements of the 

g ene . Alternatively, homocysteine -regulated polynucleotides, 
including ESTs, can be used as a foodstuff, combustible energy 
source, viscosity- enhancing solute, and the like. In a 
variation of the invention, polynucleotides of the invention 

20 are employed for diagnosis of pathological conditions or 
genetic disease that involve atherosclerosis, metabolic 
disorders, or other medical conditions related to homocysteine 
levels, and more specifically conditions and diseases that 
involve alterations in the structure or abundance of a 

25 homocysteine -regulated mRNA or gene product. 
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Antisense Polynucleotides 
Additional embodiments directed to modulation of 
homocysteine metabolism or atherosclerosis include methods 
that employ specific antisense polynucleotides complementary 
5 to all or part of a predetermined homocysteine -regulated 

sequence, such as the sequences shown in Fig. 5) # or a full- 
length genomic sequence. Such complementary antisense 
polynucleotides may include nucleotide substitutions, 
additions, deletions, or transpositions, so long as specific 

10 hybridization to the relevant target sequence corresponding to 
the subject mRNA or gene (e.g., Fig. 5) is retained as a 
functional property of the polynucleotide. Complementary 
antisense polynucleotides include soluble antisense RNA or DNA 
oligonucleotides which can hybridize specifically to mRNA 

15 species and prevent transcription of the mRNA species and/or 
translation of the encoded polypeptide (Ching et al . (1989) 
Proc. Natl. Acad. Sci. U.S.A. 86 : 10006; Broder et al - (1990) 
Ann. Int. Med . 113 : 604; Loreau et al . (1990) FEBS Letters 
274 : 53; Holcenberg et al., W091/11535; U.S. S.N. 07/530,165;, 

20 WO91/09865; WO91/04753; WO90/13641; and EP 386563, each of 
which is incorporated herein by reference) . The antisense 
polynucleotides therefore inhibit production of homocysteine- 
regulated polypeptides. Antisense polynucleotides that 
prevent transcription and/or translation of mRNA corresponding 

25 to homocysteine-regulated mRNA species may treat 

hyperhompcysteinemia, atherosclerosis, vascular disease, 
metabolic disorders, senescence, kidney and liver disorders, 
and the like, and/or reverse a metabolic phenotype of cells. 
Antisense polynucleotides of various lengths may be produced, 

30 although such antisense polynucleotides typically comprise a 
sequence of about at least 25 consecutive nucleotides which 
are substantially identical to a naturally- occurring 
homocysteine -regulated polynucleotide sequence, and typically 
which are identical to a sequence shown in Fig. 5 or a 

3 5 homocysteine -regulated sequence disclosed herein. 

Antisense polynucleotides may be produced from a 
heterologous expression cassette in a transfectant cell or 
transgenic cell, such as a transgenic pluripotent 
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hematopoietic stem cell used to reconstitute all or part of 
the hematopoietic stem cell population of an individual. 
Alternatively, the antisense polynucleotides may comprise 
soluble oligonucleotides that are administered to the external 
milieu, either in the culture medium in vitro or in the 
circulatory system or interstitial fluid in vivo . Soluble 
antisense polynucleotides present in the external milieu have 
been shown to gain access to the cytoplasm and inhibit 
translation of specific mRNA species. In some embodiments the 
antisense polynucleotides comprise methylphosphonate moieties. 
For general methods relating to antisense polynucleotides, see 
Antisense RNA and DNA , (1988), D.A. Melton, Ed., Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY) . 

Transgenic Animal Embodiments 
Genomic clones of a homocysteine-regulated gene, 
particularly of the human or mouse dhc-1 gene, may be used to 
construct homologous targeting constructs for generating cells 
and transgenic nonhuman animals having at least one 
functionally disrupted allele. Guidance for construction of 
homologous targeting constructs may be found in the art, 
including: Rahemtulla et al . (1991) Nature 353: 18 0; Jasin et 
al. (1990) Genes Devel . 4: 157; Koh et al . (1992) Science 256 : 
1210; Molina et al . (1992) Nature 357: 161; Grusby et al . 
(1991) Science 253 : 1417; Bradley et al . (1992) Bio /Technology 
10: 534, incorporated herein by reference) . Homologous 
targeting can be used to generate so-called "knockout" mice, 
which are heterozygous or homozygous for an inactivated 
allele. Such mice may be sold commercially as research 
animals for investigation of homocysteine homeostasis, amino 
acid metabolism, atherosclerosis, immune system development, 
neoplasia, aging, and the like; the nonhuman animals may be 
used as pets, may be used for animal protein (foodstuff), and 
other uses . 

Chimeric targeted mice are derived according to 

Hogan, et al . , Manipulating the Mnnae Embrvo: A Laboratory 

Manual , Cold Spring Harbor Laboratory (198 8) and 
Teratocarcinomafl and Embryonic Stem Cells: A Practical 
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Approach , E.J. Robertson, ed. , IRL Press, Washington, D.C., 
(1987) which are incorporated herein by reference. 
Embryonic stem cells are manipulated according to published 
procedures ( Teratocarcinomas and Embryonic Stem Cells: A 
5 Practical Approach , E.J. Robertson, ed. , IRL Press, 

Washington, D.C. (1987); Zjilstra et al . (1989) Nature 
342:435; and Schwartzberg et al . (1989) Science 246 : 799, each 
of which is incorporated herein by reference) . 

Additionally, a homocysteine -regulated cDNA or 

10 genomic gene copy may be used to construct transgehes for 
expressing polypeptides at high levels and/or under the 
transcriptional control of transcription control sequences 
which do not naturally occur adjacent to the homocysteine - 
regulated gene. For example but not limitation, a 

15 constitutive promoter (e.g., a HSV- tk or pgk promoter) or 
cell -lineage specific transcriptional regulatory sequence 
(e.g., an endothelial cell VEGF receptor promot er /enhancer ) 
may be operably linked to a homocysteine -regulated 
polynucleotide sequence to form a transgene (typically in 

20 combination with a selectable marker such as a neo gene 

expression cassette) . Such transgenes can be introduced into 
cells (e.g., ES cells, hematopoietic stem cells) and 
transgenic cells and transgenic nonhuman animals may be 
obtained according to conventional methods. Transgenic cells 

25 and/or transgenic nonhuman animals may be used to screen for 
ant i -atherosclerotic agents and/or to screen for potential 
pro- atherosclerotic compounds . 

Production of Homocvsteine-Reaulated Polypeptides 
30 The nucleotide and amino acid sequences obtained 

from the disclosed methods enable those of skill in the art to 
produce polypeptides corresponding to all or part of the full- 
length polypeptide sequences encoded by mRNA species which are 
regulated by homocysteine. Such polypeptides may be produced 
35 in prokaryotic or eukaryotic host cells by expression of 

polynucleotides encoding the protein, or fragments and analogs 
thereof. Alternatively, such polypeptides may be synthesized 
by chemical methods or produced by in vitro translation 
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systems using a polynucleotide template to direct translation. 
Methods for expression of heterologous proteins in recombinant 
hosts, chemical synthesis of polypeptides, and in vitro 
translation are well known in the art and are described 
5 further in Maniatis et al . , Molecular Clonincr: A Laboratory 
Manual (1989), 2nd Ed., Cold Spring Harbor, N.Y. and Berger 
and Kimmel, Methods in Etymology. Volume 152 . Guide to 
Molecular Cloning Techniques (1987), Academic Press, Inc., San 
Diego, CA. 

10 Fragments or analogs of a homocysteine-regulated 

protien may be prepared by those of skill in the art. 
Preferred amino- and carboxy- termini of fragments or analogs 
of occur near boundaries of functional domains. 

One method by which structural and functional 
15 domains may be identified is by comparison of the nucleotide 
and/or amino acid sequence data to public or proprietary 
sequence databases. Preferably, computerized comparison 
methods are used to identify sequence motifs or predicted 
protein conformation domains that occur in other proteins of 
20 known structure and/or function. For example, the NAD-binding 
domains of dehydrogenases, particularly lactate dehydrogenase 
and malate dehydrogenase, are similar in conformation and have 
amino acid sequences that are detectably homologous (Proteins, 
Structures and Molecular Principles , (1984) Creighton (ed.), 
25 W,H. Freeman and Company, New York, which is incorporated 

herein by reference) . Further, a method to identify protein 
sequences that fold into a known three-dimensional structure 
are known (Bowie et al. (1991) Science 253 : 164). Thus, the 
foregoing examples demonstrate that those of skill in the art 
3 0 can recognize sequence motifs and structural conformations 

that may be used to define structural and functional domains 
in the homocysteine-regulated sequences of the invention. 

Additionally, computerized comparison of sequences 
identified by the disclosed method, such as shown in Fig. 5, 
35 to existing sequence databases can identify sequence motifs 

and structural conformations found in other proteins or coding 
sequences that indicate similar domains of the homocysteine- 
regulated protein. For example but not for limitation, the 
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programs GAP, BESTFIT, FASTA , and TFASTA in the Wisconsin 
Genetics Software Package (Genetics Computer Group, 575 
Science Dr., Madison, WI) can be used to identify sequences in 
databases, such as GenBank/EMBL, that have regions of homology 
5 with a related sequences. Such homologous regions are 

candidate structural or functional domains. Examples include 
motifs present in other metabolic enzymes involved in amino 
acid metabolism. Alternatively, other algorithms are provided 
for identifying such domains from sequence data . Further, 

10 neural network methods, whether implemented in hardware or 
software, may be used to: (1) identify related protein 
sequences and nucleotide sequences, and (2) define structural 
or functional domains in homocysteine -regulated polypeptides 
(Brunak et al . (1991) J. Mol . Biol. 220 : 49, which is 

15 incorporated herein by reference) . 

Although one class of preferred embodiments are 
fragments having amino- and/or carboxy- termini corresponding 
to amino acid positions near functional domains borders, 
alternative fragments may be prepared. The choice of the amino - 

20 and carboxy- termini of such fragments rests with the 

discretion of the practitioner and will be made based on 
experimental considerations such as ease of construction, 
stability to proteolysis, thermal stability, immunological 
reactivity, amino- or carboxyl -terminal residue modification, 

25 or other considerations. 

In addition to fragments, analogs can be made. Such 
analogs may include one or more deletions or additions of 
amino acid sequence, either at the amino- or carboxy- termini , 
or internally, or both; analogs may further include sequence 

30 transpositions. Analogs may also comprise amino acid 
substitutions, preferably conservative substitutions. 
Additionally, analogs may include heterologous sequences 
generally linked at the amino- or carboxy- terminus, wherein 
the heterologous sequence (s) confer a functional property to 

3 5 the resultant analog which is not indigenous to the native 
homocysteine -regulated protein. However, analogs must 
comprise a segment of 25 amino acids that has substantial 
similarity to a portion of the amino acid sequences of a 
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mammalian homocysteine -regulated protein. Preferred amino 
acid substitutions are those which: (1) reduce susceptibility 
to proteolysis, (2) reduce susceptibility to oxidation, (3) 
alter post-translational modification of the analog, possibly 
including phosphorylation, and (4) confer or modify other 
physicochemical or functional properties of such analogs. 
Analogs include various muteins of a sequence other than the 
naturally-occurring peptide sequence. For example, single or 
multiple amino acid substitutions (preferably conservative 
amino acid substitutions) may be made in the naturally- 
occurring sequence . 

Conservative amino acid substitution is a 
substitution of an amino acid by a replacement amino acid 
which has similar characteristics (e.g., those with acidic 
properties: Asp and Glu) . A conservative (or synonymous) 
amino acid substitution should not substantially change the 
structural characteristics of the parent sequence (e.g., a 
replacement amino acid should not tend to break a helix that 
occurs in the parent sequence, or disrupt other types of 
secondary structure that characterizes the parent sequence) . 
Examples of art-recognized polypeptide secondary and tertiary 
structures are described in Proteins, stru ctures and Molecular 
Principles , (1984) Creighton (ed.), W.H. Freeman and Company, 
New York; Introduction to Protein Structure . (1991) , C. 
Branden and J. Tooze, Garland Publishing, New York, NY; and 
Thornton et al. (1991) Nature 354: 105; which are incorporated 
herein by reference) . 

Fusion proteins of homocysteine -regulated proteins 
can be made, such as fusions with a GAL4 activation domain or 
DNA-binding domain, and the like, such as for two-hybrid 

screening systems . 

Native homocysteine -regulated proteins, fragments 
thereof, or analogs thereof can be used as reagents in binding 
assays for identifying agents that interfere with homocysteine 
metabolism. 

Production and Applications of Antibodies 
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Native homocysteine -regulated proteins, fragments 
thereof, or analogs thereof, may be used to immunize an animal 
for the production of specific antibodies. These antibodies 
may comprise a polyclonal antiserum or may comprise a 
5 monoclonal antibody produced by hybridoma cells. For general 
methods to prepare antibodies, see Antibodies: A Laboratory 
Manual , (1988) E. Harlow and D. Lane, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY, which is incorporated 
herein by reference. 

10 For example but not for limitation, a recombinant ly 

produced fragment of a homocysteine-regulated protein can be 
injected into a mouse along with an adjuvant following 
immunization protocols known to those of skill in the art so 
as to generate an immune response. Typically, approximately 

15 at least 1-50 fig of a homocysteine-related protein fragment or 
analog is used for the initial immunization, depending upon 
the length of the polypeptide. Alternatively or in 
combination with a recombinantly produced polypeptide, a 
chemically synthesized peptide having a homocysteine -regulated 

20 sequence may be used as an immunogen to raise antibodies which 
bind a homocysteine -regulated protein. Immunoglobulins which 
bind the recombinant fragment with a binding affinity of at 
least 1 x 10 7 NT 1 can be harvested from the immunized animal as 
an antiserum, and may be further purified by immunoaf f inity 

25 chromatography or other means. Additionally, spleen cells are 
harvested from the immunized animal (typically rat or mouse) 
and fused to myeloma cells to produce a bank of antibody- 
secreting hybridoma cells. The bank of hybridomas can be 
screened for clones that secrete immunoglobulins which bind 

3 0 the recombinantly-produced polypeptide (or chemically 

synthesized polypeptide) with an affinity of at least 1 x 10 6 
M" 1 . Animals other than mice and rats may be used to raise 
antibodies; for example, goats, rabbits, sheep, and chickens 
may also be employed to raise antibodies reactive with a 

3 5 homocysteine -regulated protein. Transgenic mice having the 

capacity to produce subs t ant i ally human antibodies also may be 
immunized and used for a source of homocysteine -regulated 
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protein antiserum and/or for making monoclonal-secreting 
hybridomas . 

Bacteriophage antibody display libraries may also be 
screened for binding to a homocysteine-regulated polypeptide, 
5 such as a full-length protein, a fragment, or a fusion protein 
comprising a homocysteine-regulated polypeptide sequence. 
Combinatorial libraries of antibodies have been 
generated in bacteriophage lambda expression systems which may 
be screened as bacteriophage plaques or as colonies of 

10 lysogens (Huse et al . (1989) Science 246 : 1275; Caton and 
Koprowski (1990) Proc. Natl. Acad. Sci . (U.S.A.) 87: 6450; 
Mullinax et al (1990) Proc. Natl. Acad. Sci. (U.S.A. ) 87: 
8095; Persson et al . (1991) Proc. Natl. Acad. Sci. (U.S.A.) 
88.: 2432) . Various embodiments of bacteriophage antibody 

15 display libraries and lambda phage expression libraries have 
been described (Kang et al. (1991) Proc. Na tl. Acad. Sci. 
(U.S.A.) 88 : 4363; Clackson et al • (1991) Nature 3 52: 624; 
McCafferty et al . (1990) Nature 34 8 : 552; Burton et al . (1991) 
Proc. Natl. Acad. Sci. (U.S.A. ) 88: 10134; Hoogenboom et al. 

20 (1991) Nucleic Acids Res. 19: 4133; Chang et al . (1991) 

Immunol . 147: 3610; Breitling et al . (1991) gene 104: 147; 
Marks et al . (1991) J. Mol . Biol. 222 : 581; Barbas et al . 
(1992) Proc. Natl. Acad. Sci. (U.S.A.) 89: 4457; Hawkins and 
Winter (1992) J . Immunol . 22 : 867; Marks et al . (1992) 

25 Biotechnology 10: 779; Marks et al, (1992) J. Biol. Chem. 267 : 
16007; Lowman et al (1991) Biochemistry 30,: 10832; Lerner et 
al. (1992) Science 258 : 1313, incorporated herein by 
reference) . Typically, a bacteriophage antibody display 
library is screened with a homocysteine-regulated polypeptide 

30 that is immobilized (e.g., by covalent linkage to a 

chromatography resin to enrich for reactive phage by affinity 
chromatography) and/or labeled (e.g., to screen plaque or 
colony lifts) . 

Homocysteine-regulated polypeptides which are useful 

3 5 as immunogens, for diagnostic detection of antibodies in a 
sample, for diagnostic detection and quantitation of a 
homocysteine-regulated protein in a sample (e.g., by 
standardized competitive EL.ISA) , or for screening a 
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bacteriophage antibody display library, are suitably obtained 
in substantially pure form, that is, typically about 50 
percent (w/w) or more purity, substantially free of 
interfering proteins and contaminants. Preferably, these 
5 polypeptides are isolated or synthesized in a purity of at 

least 80 percent (w/w) and, more preferably, in at least about 
95 percent (w/w) purity, being substantially free of other 
proteins of humans, mice, or other contaminants. 

For some applications of these antibodies, such as 

10 identifying immunocrossreactive proteins, the desired 

antiserum or monoclonal antibody (ies) is/are not monospecific. 
In. these instances, it may be preferable to use a synthetic or 
recombinant fragment of a protein as an antigen rather than 
using the entire native protein. More specifically, where the 

15 object is to identify immunocrossreactive polypeptides that 

comprise a particular structural moiety, such as a DNA-binding 
domain, it is preferable to use as an antigen a fragment 
corresponding to part or all of a commensurate structural 
domain in the protein. ( 

20 

Yeast Two -Hybrid Screening Assays 
An approach to identifying polypeptide sequences 
which bind to a predetermined polypeptide sequence has been to 
use a so-called "two-hybrid" system wherein the predetermined 

25 polypeptide sequence is present in a fusion protein (Chien et 
al. (1991) Proc. Natl. Acad. Sci. (USA) 88 : 9578) . This 
approach identifies protein-protein interactions .in vivo 
through reconstitution of a transcriptional activator (Fields 
S and Song O (1989) Nature 340 : 245), the yeast Gal4 

30 transcription protein. Typically, the method is based on the 
properties of the yeast Gal4 protein, which consists of 
separable domains responsible for DNA-binding and 
transcriptional activation. Polynucleotides encoding two 
hybrid proteins, one consisting of the yeast Gal4 DNA-binding 

35 domain fused to a polypeptide sequence of a known protein and 
the other consisting of the Gal4 activation domain fused to a 
polypeptide sequence of a second protein, are constructed and 
introduced into a yeast host cell. Intermolecular binding 
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between the two fusion proteins reconstitutes the Gal4 DNA- 
binding domain with the Gal4 activation domain, which leads to 
the transcriptional activation of a reporter gene (e.g., IslcZ, 
HIS3) which is operably linked to a Gal4 binding site. 
Typically, the two-hybrid method is used to identify novel 
polypeptide sequences which interact with a known protein 
(Silver SC and Hunt SW (1993) Mol . Biol. Rep. 17: 155; Durfee 
et al. (1993) Genes Devel. 7; 555; Yang et al . (1992) Science 
257 : 680; Luban et al . (1993) Cell 73: 1067; Hardy et al . 
(1992) Genes Devel. 6; 801; Bartel et al . (1993) Biotechniques 
14: 920; and Vojtek et al. (1993) Cell 74: 205) . However, 
variations of the two-hybrid method have been used to identify 
mutations of a known protein that affect its binding to a 
second known protein (Li B and Fields S (1993) FASEB J. 7: 

957; Lalo et al. (1993) Proc . Nat l. Acad. Sci . (USA) 90: 5524; 

Jackson et al . (1993) Mol. Cell. Biol. 13; 2899; and Madura et 
al. (1993) J ■ Biol. Chem. 268 : 12046). Two-hybrid systems 
have also been used to identify interacting structural domains 
of two known proteins (Bardwell et al . (1993) tried. Migrobiol. 
8: 1177; Chakraborty et al . (1992) J. Biol. Chem. 267: 17498; 
Staudinger et al . (1993) J. Biol. Chem. 268: 4608; and Milne 
GT and Weaver DT (1993) Genes Devel. 7; 1755) or domains 
responsible for oligomerization of a single protein (Iwabuchi 
et al. (1993) Oncogene 8; 1693; Bogerd et al . (1993) J. Virol. 
67: 5030) Variations of two-hybrid systems have been used to 
study the in vivo activity of a proteolytic enzyme 

(Dasmahapatra et al . (1992) Proc. Na tl. Acad. Sci. (USA) 89: 

4159) . Alternatively, an E. coli/BCCP interactive screening 
system (Germino et al . (1993) Proc. Natl. Acad. Sci. (U.S.A.) 

90: 933; Guarente L (1993) Proc. Na tl. Acad. Sci. (U.S.A. ) £0: 

163 9) can be used to identify interacting protein sequences 
(i.e., protein sequences which heterodimerize or form higher 
order heteromultimers) . 

Each of these two-hybrid methods rely upon a 
positive association between two Gal4 fusion proteins thereby 
reconstituting a functional Gal4 transcriptional activator 
which then induces transcription of a reporter gene operably 
linked to a Gal4 binding site. Transcription of the reporter 
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gene produces a positive readout, typically manifested either 
(1) as an enzyme activity (e.g., 0-galactosidase) that can be 
identified by a colorimetric enzyme assay or (2) as enhanced 
cell growth on a defined medium (e.g., HIS3) . A positive 
5 readout condition is generally identified as one or more of 
the following detectable conditions: (1) an increased 
transcription rate of a predetermined reporter gene, (2) an 
increased concentration or abundance of a polypeptide product 
encoded by a predetermined reporter gene, typically such as an 

10 enzyme which can be readily assayed in vivo , and/or (3) a 

selectable or otherwise identifiable phenotypic change in an 
organism (e.g., yeast) harboring the reverse two-hybrid 
system. Generally, a selectable or otherwise identifiable 
phenotypic change that characterizes a positive readout 

15 condition confers upon the organism either: a selective growth 
advantage on a defined medium, a mating phenotype, a 
characteristic morphology or developmental stage, drug 
resistance, or a detectable enzymatic activity (e.g., /?- 
galactosidase, lucif erase, alkaline phosphatase, and the 

20 like) . 

Transcriptional activators are proteins that 
positively regulate the expression of specific genes. They 
can be functionally dissected into two structural domains : 
one region that binds to specific DNA sequences and thereby 

25 confers specificity, and another region termed the activation 
domain that binds to protein components of the basal gene 
expression machinery (Ma and Ptashne (1988) Cell 55.: 443) . 
These two domains need to be physically connected in order to 
function as a transcriptional activator. Two-hybrid systems 

3 0 exploit this finding by hooking up an isolated DNA binding 
domain to one protein (protein X) , while hooking up the 
isolated activation domain to another protein (protein Y) . 
When X and Y interact to a significant extent, the DNA binding 
and activation domains will now be connected and the 

35 transcriptional activator function reconstituted (Fields and 
Song (1989) Nature 340 : 245) . The yeast host strain is 
engineered so that the reconstituted transcriptional activator 
drives the expression of a specific reporter gene such as HIS3 
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or lacZ, which provides the read-out for the protein-protein 
interaction (Field and Song (1989) op.cit . ; Chein et al . 
(19 91) op.cit . ) . One advantage of two -hybrid systems for 
monitoring protein-protein interactions is their sensitivity 
in detection of physically weak, but physiologically 
important, protein-protein interactions. As such it offers a . 
significant advantage over other methods for detecting 
protein-protein interactions (e.g., EL ISA assay). 

The invention also provides host organisms 
(typically unicellular organisms) which harbor a homocysteine - 
regulated protein two-hybrid system, typically in the form of 
polynucleotides encoding a first hybrid protein, a second 
hybrid protein, and a reporter gene, wherein said 
polynucleotide (s) are either stably replicated or introduced 
for transient expression. In an embodiment, the host organism 
is a yeast cell (e.g., Saccharomvces cervisiae ) and in which 
the reporter gene transcriptional regulatory sequence 
comprises a Gal4 -responsive promoter. 

Yeast comprising (1) an expression cassette encoding 
a GAL4 DNA binding domain (or GAL4 activator domain) fused to 
a binding fragment of a homocysteine-regulated protein (2) an 
expression cassette encoding a GAL4 DNA activator domain (or 
GAL4 binding domain, respectively) fused to a member of a cDNA 
library, and (3) a reporter gene (e.g., /3-galactosidase) 
comprising a cis-linked GAL4 transcriptional response element 
can be used for screening for cDNA sequences encoding 
polypeptides which bind to the homocysteine-regulated protein 
with high affinity. 

Yeast two -hybrid systems may be used to screen a 
mammalian (typically human) cDNA expression library, wherein 
cDNA is fused to a GAL4 DNA binding domain or activator 
domain, and a homocysteine-regulated polypeptide sequence is 
fused to a GAL4 activator domain or DNA binding domain, 
respectively. Such a yeast two-hybrid system can screen for 
cDNAs that encode proteins which bind to homocysteine- 
regulated sequences. For example, a cDNA library can be 
produced from mRNA from a human mature B cell (Namalwa) line 
(Ambrus et al . (1993) Proc . Na tl- Acad. Sci , (U.S.A.)) or 
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other suitable cell type. Such a cDNA library cloned in a 
yeast two-hybrid expression system (Chien et al . (1991) Proc. 
Natl. Acad. Sci. (U.S.A. ) 88 : 9578) can be used to identify 
cDNAs which encode proteins that interact with a homocysteine - 
5 regulated protein and thereby produce expression of the GAL4- 
dependent reporter gene. Polypeptides which interact with a 
homocysteine -regulated protein can also be identified by 
immunoprecipitation of the homocysteine -regulated protein with 
antibody and identification of co-precipitating species. 

10 Further, polypeptides that bind a homocysteine -regulated 
protein can be identified by screening a peptide library 
(e.g., a bacteriophage peptide display library, a spatially 
defined VLSIPS peptide array, and the like) with a 
homocysteine -regulated polypeptide . 

15 Once such cDNAs encoding such interacting 

polypeptides are identified, they may be used for screening a 
bank of compounds (e.g., small molecule libraries) to identify 
agents which inhibit the binding interaction. Yeast 
comprising (1) an expression cassette encoding a GAL4 DNA 

20 binding domain (or GAL4 activator domain) fused to a binding 
fragment of homocysteine -regulated protein (2) an expression 
cassette encoding a GAL4 DNA activator domain (or GAL4 binding 
domain, respectively) fused to the selected cDNA encoding the 
interacting protein, and (3) a reporter gene (e.g., /3- 

25 galactosidase) comprising a cis-linked GAL4 transcriptional 
response element can be used for screening for agents which 
inhibit the cDNA-encoded polypeptide from binding to the 
homocysteine -regulated protein with high affinity. Such yeast 
are incubated with a test agent and expression of the reporter 

30 gene (e.g., /3-galactosidase) is determined; the capacity of 
the agent to inhibit expression of the reporter gene as 
compared to a control culture identifies the agent as a 
candidate atherosclerosis modulating agent. 

Administration of an efficacious dose of an agent 

3 5 capable of specifically inhibiting gene expression related to 
hyperhomocystenemia to a patient can be used as a therapeutic 
or prophylactic method for treating pathological conditions 
(e.g., hyperhomocysteinemia, atherosclerosis, and the like) 
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which are effectively treated by modulating cellular 
responsiveness to homocysteine. 

Methods for Forensic Identification 
The homocysteine -regulated polynucleotide sequences of 
the present invention can be used for forensic identification 
of individual humans, such as for identification of decedents, 
determination of paternity, criminal identification, and the 
like. For example but not limitation, a DNA sample can be 
obtained from a person or from a cellular sample (e.g., crime 
scene evidence such as blood, saliva, semen, blood-stained 
gloves, blood spots on the door of a white Ford Bronco, and 
the like) and subjected to RFLP analysis, allele-specif ic PCR, 
or PCR cloning and sequencing of the amplification product to 
determine the structure of the homocysteine-regulated gene 
region. On the basis of the homocysteine-regulated gene 
structure, the individual from which the sample originated 
will be identified with respect to his/her homocysteine- 
regulated protein genotype. The homocysteine-regulated 
protein genotype may be used alone or in conjunction with 
other genetic markers to conclusively identify an individual 
or to rule out the individual as a possible perpetrator. 

In one embodiment, human genomic DNA samples from a 
population of individuals (typically at least 50 persons from 
various racial origins) are individually aliquoted into 
reaction vessels (e.g., a well on a microtitre plate) . Each 
aliquot is digested (incubated) with one or more restriction 
enzymes (e.g., EcoRI, Hindlll, Smal, BamHI, Sail, NotI, AccI, 
Apal, Bglll, Xbal, PstI) under suitable reaction conditions 
(e.g., see New England Biolabs 1995 catalog) . Corresponding 
digestion products from each individual are loaded separately 
on an electrophoretic gel (typically agarose) , 
electrophoresed, blotted to a membrane by Southern blotting, 
and hybridized with a labeled homocysteine-regulated probe 
(e.g., a Fig. 5). Restriction fragments (bands) which are 
polymorphic among members of the population are used as a 
basis to discriminate homocysteine-regulated protein genotypes 
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and thereby classify individuals on the basis of their 
homocysteine -regulated protein genotype . 

Similar categorization of homocysteine -regulated 
genotypes may be performed by sequencing PCR amplification 
5 products from a population of individuals and using sequence 
polymorphisms to identify alleles (genotypes) , and thereby 
identify or classify individuals. 

The invention also provides homocysteine -regulated 
polynucleotide probes for diagnosis of disease states (e.g., 

10 hyperhomocysteinemia, atherosclerosis) by detection of a 

homocysteine -regulated mRNA in cells explanted from a patient, 
or detection of a pathognomonic homocysteine -regulated gene 
allele (e.g., by RFLP or allele-specif ic PCR analysis) which 
predisposes to hyperhomocysteinemia or atherosclerosis . 

15 Typically, the detection will be by in situ hybridization 
using a labeled (e.g., 32 P, 35 S, 14 C, 3 H, fluorescent, 
biotinylated, digoxigeninylated) homocysteine -regulated 
polynucleotide, although Northern blotting, dot blotting, or 
solution hybridization on bulk RNA or poly A" RNA isolated .,- 

20 from a cell sample may be used, as may PCR amplification using 
homocysteine -regulated protein specific primers. Cells which 
contain an altered amount of a homocysteine-regulated mRNA as 
compared to normal cells of the same cell type(s) will be 
identified as originating from disease candidates. Similarly, 

2 5 the detection of pathognomonic rearrangements or amplification 

of the homocysteine-regulated gene locus or closely linked 
loci in a cell sample will identify the presence of a 
pathological condition or a predisposition to developing a 
genetically linked pathological condition (e.g., cancer, 

3 0 genetic disease) . The polynucleotide probes are also used for 

forensic identification of individuals, such as for paternity 
testing or identification of criminal suspects (e.g., O.J. 
Simpson) or unknown decedents. 



35 



Homocysteine -Reponsive Transcription Elements 



A transcriptional regulatory sequence can be 
isolated from an endogenous homocysteine-regulated gene, 
typically by genomic DNA cloning. In such genes, the 
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transcriptional regulatory sequence is at least the minimal 
sequence (s) required for efficient homocysteine -induced or 
homocysteine -repressed expression, which generally is at least 
a promoter and at least about 0.2 kilobase (kb) upstream of 
5 the promoter, preferably at least about 1 to 3 kb upstream of 
the promoter, more preferably at least about 5 kb upstream of 
the promoter, and frequently at least about 8 or more kb 
upstream of the promoter. Frequently, sequences downstream of 
the promoter, especially intronic sequences, are included. 

10 Reporter constructs can be made. Usually the sequences 
upstream of the promoter are used contiguously, although 
various deletions and rearrangements can be employed. Some 
desired regulatory elements {e.g., enhancers, silencers) may 
be relatively position- insensitive, so that the regulatory 

15 element will function correctly even if positioned differently 
in a reporter construct than in the corresponding germline 
gene. For example, an enhancer may be located at a different 
distance from a promoter, in a different orientation, and/or 
in a different linear order. For example, an enhancer that is 

20 located 3' to a promoter in germline configuration might be 
located 5' to the promoter in a reporter construct. Where 
convenient, it is preferred that a contiguous segment of 
genomic DNA sequence spanning the homocysteine-regulated gene 
and containing as much upstream flanking sequence as 

25 convenient {typically at least about 1-10 kb) be used in the 

reporter construct or a gene therapy construct. * It is further 
recognized that a homocysteine-regulated gene may comprise 
multiple promoters, which may individually be cell -specif ic . 

30 Diagnostic and Therapeutic Applications 

A change (typically an increase) in the level of a 
homocysteine-regulated mRNA in cells of in a biological sample 
from an individual which is outside the range of clinically 
established normal levels indicates the presence of an active 

35 disease or condition (e.g., hyperhomocysteinemia, 

atherosclerosis, and the like) in the individual from whom the 
sample was obtained and/or indicates a predisposition of the 
individual for developing (or progressing through) the 
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disease. Thus, detection of a pathognomonic concentration 
(relative abundance in an mRNA pool, such as may be 
standardized to GAPDH) of a homocysteine- regulated mRNA in a 
patient sample, such as a blood, biopsy, serum, saliva, or 
buccal cell sample, is an indicator of hyperhomocysteinemia 
and atheroscleotic disease. 

Further, a homocysteine -regulated mRNA may be used 
as a differentiation marker to identify and type cells of 
certain lineages and developmental origins. Such cell-type 
specific detection may be used for histopathological diagnosis 
of neoplasms (e.g., for diagnosing the cell type and/or 
differentiation state of a primary tumor cell) or other 
applications (e.g., localizing imaging or toxic agents to 
specific locations for magnetic imaging or radioimaging in 
vivo or for cytotoxic effect) . Such agents may include, for 
example, a linked component comprising: metals, 
chemotherapeutic drugs, radiosensitizing agents, cellular 
toxins, radionuclides, and others. 

Various other applications of such agents that 
specifically detect a homocysteine -regulated mRNA or the 
encoded protien are apparent to those of skill in the art and 
may be developed. 

Detection of Homocvsteine-Requlated Genes Samples 
The identification and measurement of a 
homocysteine -regulated mRNA or the encoded protein in a 
biological sample can be performed by one or more assay 
methods of the invention. The general scheme of such an assay 
method requires that a biological sample is obtained from a 
human patient. The biological sample is typically a cellular 
sample, such as blood, tissue biopsy material, or buccal 
scapings, may be used. 

Antibody Embodiment 
Although many variations of the invention are 
possible, including assays using a single antibody that is 
essentially monospecific for a homocysteine -regulated protien 
and competitive immunoassays, a two-component sandwich binding 
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assay is believed to be generally preferable. In one 
embodiment of a sandwich binding assay, an immobilized binding 
component binds to a site on a homocysteine -regulated protein 
and a soluble labeled binding component binds to a second site 
5 on the homocysteine -regulated protien, unbound components are 
removed (typically by washing with a suitable buffered 
solution and/or centrif ugation) , and the amount of labeled 
binding component that remains immobilized via linkage through 
the analyte (i.e., the homocysteine-regulated protein) is 

10 quantitatively measured to determine the amount of 

homocysteine-regulated protein present in the sample. Some 
sandwich assays require addition of immobilized binding 
component and analyte (i.e., sample) followed by separation of 
bound and unbound analyte, then followed by addition of 

15 soluble labeled binding component. 

Using the methods disclosed herein, others skilled 
in the art can readily produce suitable immunological 
reagents. Moreover, methods (Kohler and Milstein (1976) Eur. 
J. Immunol. 6: 511, incorporated herein by reference) are 

20 known for producing a monoclonal antibody, such as one that 
binds to a a homocysteine-regulated protein with an affinity 
of typically at least about 1 x 10 6 M" 1 , preferably at least 
about 1 x 10 B M" 1 , and more preferably at least about 1 xlO 9 to 
1 x 10 11 M" 1 or more, and which has a binding affinity for a 

25 non-specific proteins of at least about 10-fold less, 

preferably at least about 100- to 1000-fold less, with a 
sufficient difference in binding affinity between the 
homocysteine-regulated protein and the remaining protein pool 
so as to be a specific binding component. In general terms, 

30 such a method comprises the steps of: (1) immunizing an animal 
with a composition containing an immunogenic dose of a 
homocysteine-regulated protein (or a fragment thereof) , 
usually at least about 50 to 500ug of purified material, 
typically co- administered with an adjuvant, (2) harvesting B- 

35 cells from the animal, (3) fusing the B-cells with a myeloma 
to generate a bank of hybridoma clones, and (4) selecting, 
from the bank, a hybridoma clone that expresses a monoclonal 
antibody which binds to the homocysteine-regulated protein 
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with an affinity of at least about 1 x 10 7 NT 1 and which has a 
binding affinity for nonfat milk and calf serum of at least 
about 10- to 1000-fold lower. 

Additionally, a method for producing a polyclonal 
5 antiserum that specifically or preferentially binds to a 

homocysteine-regulated protein is described. In general, such 
a polyclonal antiserum will have an affinity for the 
homocysteine-regulated protein of at least 1 x 10 e M' 1 and 
which has a binding affinity for nonfat milk solids of at 

10 least about 10- to 1000-fold lower, comprising the steps of: 
(1) immunizing an animal with a composition containing an 
immunogenic dose of a homocysteine-regulated protein or a 
fragment thereof (and typically an adjuvant) , (2) obtaining a 
polyclonal antiserum from said animal, (3) preadsorbing said 

15 polyclonal antiserum with a saturating amount of a nonfat milk 
solids or serum, and (4) recovering antibodies that do not 
substantially bind to the nonfat milk solids or serum. 

Detection of bound "sandwich" complexes typically is 

2 0 performed by detecting the presence of bound labeled binding 

component. Usually, the labeled binding component comprises a 
direct label, such as a fluorescent label, radioactive label, 
or enzyme -conjugated label that catalyzes the conversion of a 
chromogenic substrate to a chromophore. However, it is 

25 possible, and often desirable for signal amplification, for 
the labeled binding component to be detected by at least one 
additional binding component that incorporates a label. 
Signal amplification can be accomplished by layering of 
reactants where the reactants are polyvalent. Direct labeling 

30 of binding components may be accomplished by various methods 
known in the art, including: covalent attachment of various 
enzymes (e.g., horseradish peroxidase, 0-galactosidase , and 
alkaline phosphatase) , fluorescent molecules (fluorescein 
isothiocyanate, rhodamine, dansyl chloride) , and radioactive 

3 5 materials ( 3 H, 14 C, 32 P, 35 S, 125 I, 131 I) which may be incorporated 

as labeled amino acid residues or post-translationally labeled 
(e.g., Chloramine T method) . 
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Diagnostic Kits 
The invention also encompasses diagnostic kits that 
typically contain the assay reagents in a prepackaged form. 
The assay components will typically include homocysteine - 
5 regulated protein binding component, typically a specifically 
reactive antibody, and a labeled component (which may be 
attached or incorporated into the homocysteine -regulated 
protein binding component or may be provided separately) f a 
suitable reaction vessel (e.g., a microtiter well, a polymer 

10 membrane), and may also include diluent (s), assay buffer(s), 
wash buffer (s), and development reagents (e.g., chromogenic 
substrate for enzyme-linked assays) . The various assay 
components are usually provided in aqueous solution, but may 
be provided in lyophilized form and reconstituted for 

15 performing the assay. Similarly, automated devices may be 
developed to perform assays of the invention. 

The following examples are given to illustrate the 
invention, but are not to be limiting thereof . All 
percentages given throughout the specification are based upon 

20 weight unless otherwise indicated. All protein molecular 
weights are based on mean average molecular weights unless 
otherwise indicated . 

EXPERIMENTAL EXAMPLES 

25 Overview 

At present, there is no simple, reliable and/or 
quick test to identify individuals with hyperhomocysteinemia. 
In addition, plasma homocysteine levels in individuals 
heterozygous for cystathionine 0-synthase (the major inherited 

3 0 form of hyperhomocysteinemia) can overlap with those of 

healthy individuals, thus complicating the identification of 
those at risk for premature atherosclerosis. In addition, 
strategies to monitor and improve this condition are lacking 
due to the poor understanding of the mechanisms responsible 

35 for homocysteine -induced vascular disease. The examples 

identify homocysteine- induced genes and their gene products 
for the successful diagnosis and treatment of homocysteine - 
induced vascular disease. 
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mRNA differential display (Liang and Pardee, 1992; 
Liang et al . 1993) was used to identify specific homocysteine- 
induced genes in human umbilical vein endothelial cells 
(HLJVECs) and the human megakaryocytic cell line Dami exposed 
5 to homocysteine. Briefly, mRNA differential display patterns 
from HLTVECs and Dami cells exposed to homocysteine were 
compared on 6% sequencing gels to identify differentially 
expressed cDNA fragments (Fig. 1) . Homocysteine- induced cDNA 
fragments were defined as those fragments that were 

10 specifically and reproducibly up-regulated on differential 
display gels. Approximately 10-20 cDNA fragments have been 
identified in HUVECs and Dami cells as homocysteine- induced 
genes. One of these, designated dhc-1, was consistently 
induced in Dami cells treated with homocysteine (Fig. 2) . To 

15 confirm the homocysteine -induced up-regulation pattern 

observed on the differential display gels, the selected cDNA 
fragments, including dhc- 1, were recovered, reamplified and 
used to probe Northern blots prepared with total RNA isolated 
from HUVECs and Dami cells treated in the presence or absence 

20 of homocysteine, dhc-2 was shown to specifically hybridize to 
a 1.8 kb transcript on Northern blots that reproduced the 
homocysteine -specif ic increase in expression (Fig. 3) . dhc-1 
was also shown to be up-regulated, in a dose -dependent mannier, 
in HUVECs exposed to increasing concentrations of homocysteine 

25 (Fig. 4) . Cloned cDNA fragments that showed homocysteine- 
dependent up-regulation on Northern blots were subsequently 
cloned into pBluescript (Clontech, Palo Alto, CA) and 
partially sequenced using T3 or T7 sequencing primers. The 
partial nucleotide sequence of dhc-1 is shown in Figure 5. 

3 0 Searching of the GenBanJc database revealed that dhc -I has 96% 
homology to a novel human CDNA, HHCJ64 (accession #M62278) . 
The function of this gene is not known, however, its origin 
may be mitochondrial. This same approach, from identifying 
homocysteine -induced cDNA fragments on differential gels to 

3 5 searching GenBank for homology to other genes, is being used 

to characterize the additional homocysteine -induced genes that 
have been identified in HUVECs and Darni cells. 
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Thus, the example indicates successful 
identification of specific homocysteine -induced genes 
expressed in endothelial cells and megakaryocytes. These 
sequences are be used to successfully identify individuals 
5 predisposed to elevated levels of blood homocysteine. 

Secondly, it provides a means of identifying whether or not 
treatments which are supposedly known to decrease homocysteine 
levels actually result in the reduction of endothelial cell 
injury which is responsible for the atherosclerotic effects of 

10 homocysteine. In addition, by identifying these homocysteine- 
induced genes and their gene products it is possible to better 
understand the mechanisms involved in homocysteine -induced 
vascular disease. Furthermore, these genes and their gene 
products allow detection and improve the outcome of premature 

15 vascular disease and thrombosis, regardless of its origin. 

EXAMPLE 1 

mRNA differential display, followed by Northern blot 
analysis was used to identify homocysteine- induced genes 

20 expressed in HUVECs and Dami cells. HUVECs were used in these 
studies because previous data has shown that homocysteine is 
cytotoxic to the endothelium. In addition, endothelial damage 
followed by vascular occlusion are common and are the usual 
cause of premature death in patients homozygous for 

25 cystathionine /3- synthase. Dami cells, a megakaryocyte cell 
line were used because of the ease at which platelets can be 
isolated and analyzed. Thus, specific homocysteine -induced 
genes expressed in megakaryocytes are detected by isolating 
platelets. The method of differential display involves the 

30 reverse transcription of mRNAs with oligo-dT primers 

containing additional nucleotides which anchor the primer to 
the beginning of the poly (A) tail, followed by a PCR reaction 
in the presence of an arbitrary 10- I 5mer and the original 
anchor primer. The amplified cDNA subpopulations are then 

3 5 separated on denaturing sequencing gels to identify 

homocysteine- induced cDNAs . Once these cDNAs are shown to be 
reproducibly up-regulated on differential display gels, they 
are removed from the gel, PCR-amplif ied and used as probes on 
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Northern blots to confirm their homocysteine- induced 
expression. These homocysteine -induced cDNA fragments are 
then sequenced and compared to the GenBank databases to 
determine if these cDNAs are novel or have been identified 
previously. The cDNA fragments of interest can also be used 
as probes to screen appropriate cDNA libraries for the 
complete gene of interest. Once cloned, the open reading 
frame of the cDNAs of interest can be expressed as recombinant 
fusion proteins in bacteria and used as immunogens for the 
production of specific polyclonal or monoclonal antibodies 
directed to the specific gene product. As a result, 
homocysteine -induced expression of these gene products can be 
easily determined. The Materials and Methods used to identify 
these homocysteine -induced genes from HUVECs and Datni cells 
are described in detail below: 
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Cell Culture 

Dami cells were obtained from the American Type 
Culture Collection (ATCC) (Rockville, MD) and cultured in 75 
ml flasks in RPMI 1640 medium supplemented with 5% heat-- 
inactivated fetal bovine serum, 1 00 U/ml penicillin and 100 
/ig/ml streptomycin. Endothelial cells were obtained from 
fresh human umbilical veins and routinely cultured in 75 ml 2 
flasks coated with human fibronectin (5 fig/am 2 ) in M199 medium 
containing 10% fetal calf serum, 100 IU penicillin, 100 /xg/ml 
streptomycin, 100 fig/ml heparin and 30 /zg/ml endothelial cell 
growth supplement. Cultures were maintained at 37°C in a 
humidified atmosphere of 5% C0 2 in air. DL-homocysteine 
(Sigma, St. Louis, MO) was prepared in sterile water, filtered 
and added to 60% confluent cell cultures at the desired 
concentration (0.1 to 5 mM) . After incubation for 18 hr with 
control or homocysteine-containing media, cells were washed 
twice with sterile PBS prior to RNA isolation. 

Total RNA Isolation and Purification 

All solutions were prepared with water that had been 
treated with 0.1% diethyl pyrocarbonate (DEPC) and autoclaved 
prior to total RNA isolation. Total RNA was isolated from 
cultured cells using the RNeasy total RNA kit as described by 
the manufacturer (Qiagen, Chatsworth, CA) and resuspended in 
DEPC- treated water. DNA contamination was removed by 
incubating the samples with 2 U of RQ DNase (Promega, Madison, 
WI) in the presence of 12 U of Rnasin (Promega) for 3 0 min at 
37°C. After the addition of DEPC-treated water to 100 /il # the 
samples were mixed with an equal volume of phenol/chloroform 
(3:1), centrifuged and the aqueous phase transferred to an 
RNase-free 1 . 7 ml centrifuge tube. After the addition of 10 
jil of 3 M sodium acetate and 3 00 /a1 of absolute ethanol, the 
RNA was precipitated overnight at -80°C. The RNA was 
collected by centrif ugation at 14,000 X g for 10 min at 40°C, 
washed briefly with 85% ethanol, dried and resuspended in 
DEPC-treated water. The A260/280 ratio was determined and 
used to calculate the amount and purity of each RNA sample. 
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The RNA samples were subsequently resuspended in 10 /xg 
aliquots and stored at -80°C* 



mRNA Differential Display 
5 After removal of DNA contamination from the total 

RNA samples, differential display was carried out using the 
RNA map and image kits (GenHunter, Brookline, MA) as described 
by the manufacturer. The cDNAs were amplified by PCR in the 
presence of a-S 35 dATP (specific activity > 1000 Ci/mnol; NEN, 

10 Toronto, ON) using a Perkin-Elmer 4 80 thermal cycler. 

Parameters for the 40-cycle PCR were as follows: denaturation 
at 94°C for 30 sec, annealing at 40°C for 120 sec and 
extension at 72 °C for 30 sec. Following the PCR, DNA loading 
buffer (GenHunter) was added to 4 pi of each reaction mixture 

15 and heated to 80°C for 2 min. The radiolabelled PCRamplified 
cDNA fragments were then resolved on a 6% sequencing gel run 
at 55 W constant temperature. The gels were dried without 
fixation onto Whatman 3M filter paper and exposed to Kodak 
XAR-5 film for approximately 1-3 days at room temperature. , 

2 0 cDNA fragments which were shown to be clearly up- regulated in 

the presence of homocysteine were subsequently confirmed by 
repeat differential display reactions using total RNA isolated 
from new cell cultures treated as described previously. 

25 Recovery and Reamplif ication cDNA fracrments 

cDNA fragments showing reproducible up -regulation 
were excised from the differential display gels using a 
sterile scalpel blade. Gel slices were then placed in 100 fil 
of water for 10 min and heated to 100°C for 15 min. The tubes 

30 were spun in a microcentrifuge at 14,000 X g for 2 min and the 
supernatant transferred to a new microfuge tube. The cDNA 
fragments were ethanol precipitated in the presence of 
glycogen, washed with 85% ethanol and resuspended in 10 /il of 
water. For reamplif ication, 4 /xl of each sample was used. 

3 5 PCR conditions were as described above using the appropriate 

primer sets and conditions except that the dNTP concentration 
was 20 /iM and no a-S 35 dATP was present . The amplified cDNA 
fragments were subsequently purified from 1% agarose gels 
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using the Qiaex gel extraction kit (Qiagen, Chatsworth, CA) 
and used directly as probes on Northern blots . 

Northern Blot Analysis 

DNase-free total RNA (10 fig) from control and 
homocysteine- treated cells were fractionated on 2.2M 
f ormaldehyde/1 . 5% agarose gels and transferred overnight onto 
Zeta- Probe GT nylon membranes (Bio-Rad, Toronto, ON) in 50 mM 
NaOH. The RNA was crosslinked to the membrane using a 
Stratalinker UV crosslinker (PDI Bioscience, Toronto, ON) 
prior to hybridization. Specific probes were generated by 
labelling the reamplified or cloned cDNA fragments with t<*- 
P 32 ] -dCTP (ICN, Mississauga, ON) using a random primer DNA 
labelling kit (Boehringer Mannheim, Laval, QC) . After 
overnight hybridization at 42°C, the filters were washed twice 
in 2XSSC/0.1% SDS, followed by two washes at 55°C for 15 min 
in O.lXSSC/0*l% SDS. The membranes were then sealed in 
plastic wrap and exposed to Kodak X-Omat AP film at -80°C for 
approximately 1-3 days. Hybridization of the same blot with a 
partial cDNA fragment of human glyceraldehyde 3 -phosphate 
dehydrogenase { GAPDH ) was used as a loading control. The 
quantification of changes in the expression of the 
homocysteine-dependent transcripts was performed using an 
UltroScan XL laser densitometer (Pharmacia, Bale d'Urfe, QC) . 

Cloning and Sequencing of cDNA Fragments 

cDNA fragments which showed differential expression 
on Northern blots were then subcloned into T-ended pBluescript 
<KS) and sequenced using a modified T7 polymerase system 
(Sequenase Version 2, USB, Cleveland, OH) with either T7 or T3 
primers . The cDNA sequences were analyzed for homology with 
the EMBL and GenBank DNA databases using the DNASIS DNA 
sequencing analysis system (Hitachi, Brisbane, CA) . 

The foregoing description of the preferred 
embodiments of the present invention has been presented for 
purposes of illustration and description. They are not 
intended to be exhaustive or to limit the invention to the 
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precise form disclosed, and many modifications and variations 
are possible in light of the above teaching. 

Such modifications and variations which may be 
apparent to a person skilled in the art are intended to be 
within the scope of this invention. 



WO 97/25440 PCT/C A97/0000 1 

74 

CLAIMS : 

1 . A method for diagnosing an individual having 
hyperhomocysteinemia and/or a predisposition to developing 
premature atherosclerosis, comprising: 

identifying, from a pool of individuals, an 
individual having elevated levels of serum homocysteine by 
determining the relative expression level of at least one 
homocysteine -regulated mRNA or protein encoded therein in a 
cellular sample obtained from the individual, whereby a 
relative expression level of the homocysteine -regulated mRNA 
ox protein is in a predetermined pathognomonic range and the 
individual is thereby identified as an atherosclerosis disease 
candidate and a hyperhomocystenemia candidate, 

2. The method of claim 1, wherein the 
homocysteine-regulated mRNA or protein is a mRNA comprising a 
predetermined sequence of at least 3 0 nucleotides and known to 
be regulated by homocysteine. 

3. The method of claim 1, wherein the 
homocysteine-regulated mRNA or protein is a protein encoded by 
an mRNA comprising a predetermined sequence of at least 3 0 
nucleotides and known to be regulated by homocysteine. 

4 . The method of claim 1 , wherein the 
determination of the relative expression level of the 
homocysteine-regulated mRNA is performed by nucleic acid 
hybridization with a probe polynucleotide comprising a 
sequence of at least 25 nucleotides which is substantially 
identical to a complementary sequence of a mRNA predetermined 
to be regulated by homocysteine. 

5. The method of claim 1, wherein the 
determination of the relative expression level of the 
homocysteine-regulated mRNA is performed by polynucelotide 
amplification with a primer or primer set comprising a 
sequence of at least 25 nucleotides which is substantially 
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6 identical to a corresponding or complementary sequence of a 

7 mRNA predetermined to be regulated by homocysteine, 

1 6. The method of claim 1, wherein the relative 

2 expression level of the homocysteine-regulated protein is 

3 performed by an immunoassay for quantitative analysis of a 

4 homocysteine -regulated protein, wherein the immunoassay 

5 comprises an antibody predetermined to specifically bind to 

6 the homocysteine -regulated protein. 

1 7. The method of claim 1, wherein the relative 

2 expression level is determined by nucleic acid hybridization, 

3 polynucleotide amplification, or other means of quantitating 

4 relative abundance of an mRNA species, and the quantitation 

5 result determined for the homocysteine -regulated mRNA is 

6 standardized to a quantitation result in the same sample for 

7 GAPDH mRNA. 

1 8. A method for identifying polynucleotides and 

2 polynucleotide sequences of mRNAs and the corresponding cDNAs 

3 which are substantially increased or substantially decreased 

4 in cells of patients having hyperhomocysteinemia or a 

5 predisposition to homocysteine -mediated atherosclerosis, 

6 comprising: 

7 obtaining a first mRNA sample from cells exposed to 

8 hyperphysiological levels and obtaining a second mRNA sample 

9 from cells which are exposed to normophysiological levels of 

10 homocysteine or subphysiological levels of homocysteine; 

11 amplifying the first and second mRNA samples 

12 individually by PCR amplification with a 5' arbitrary 

13 primer (s) and, optionally, a 3' arbitrary primer (s) or an 

14 oligo(dT) 3' primer, under suitable reaction conditions 

15 whereby PCR amplification product (s) are labeled; 

16 electrophoresing resultant PCR amplification 

17 products on a sequencing gel or other size-discriminatory 

18 separation method and identifying discrete size fractions 
19' which are significantly increased or decreased in the sample 
2 0 of PCR products amplified from the mRNA sample obtained from 
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21 cells exposed to a hyperphysiological homocysteine level 

22 relative to the sample obtained from cells exposed to 

23 normophysiological or subphysiological homocysteine levels. 

1 9. The method of claim 8, comprising the further 

2 step of collecting, sequencing, or cloning species of the PCR 

3 amplification products which are significantly increased or 

4 decreased in the sample of PCR products amplified from the 

5 mRNA sample obtained from cells exposed to a 

6 hyperphysiological homocysteine level relative to the sample 

7 obtained from cells exposed to normophysiological or 

8 subphysiological homocysteine levels, 

1 10. The method of claim 8, wherein the cells are 

2 cultured cells . 

1 11. The method of claim 8, wherein the cells are 

2 nucleated blood cells, endothelial cells, fibroblasts, 

3 platelets, buccal cells, chorionic villi cells, or cells 

4 comprising a vascular structure such as an artery, vein, 

5 arteriole, venule, or capillary. 

1 12 . A polynucleotide obtained by the method of 

2 claim 9. 

1 13 . A polynucleotide of claim 12, wherein the 

2 polynucleotide comprises a sequence of at least 3 0 nucleotides 

3 of dhc-1. 

1 14 . A polynucleotide of claim 12 or 13 which is 

2 labeled with a detectable label. 

1 15. A polynucleotide comprising a sequence of at 

2 least 25 consecutive nucleotides which is identical to or 

3 complementary to portion of an mRNA species which is induced 

4 or repressed by homocysteine or other sulfur-containing amino 

5 acids. 
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1 16 . A polynucleotide of claim 15, wherein the 

2 polynucleotide comprises a substantially full-length cDNA 

3 sequence identical to or complementary to a mammalian mRNA 

4 species which is substantially increased or substantially 

5 decreased in relative abundance in cells exposed to 

6 hyperphysio logical levels of a sulfur-containing amino acid. 

1 17. A polynucelotide of claim 5, wherein the 

2 polynucelotide is dhc-l. 

1 18. A dhc-l antisense polynucleotides comprising a 

2 sequnece substantially identical to at least 2 5 contiguous 

3 nucleotide of the complementary sequence of the dhc-l cDNA 

4 sequence shown in Fig . and denoted SEQ ID NO : 1 . 

5 19. A composition for diagnosing a homocysteine- 

6 related pathological condition, comprising a polynucleotide 

7 probe comprising at least 25 nucleotide which are 

8 substantially identical to a homocysteine-regulated mRNA. 

1 20. A composition of claim 19, wherein the 

2 polynucleotide probe is labeled i 

1 21. A composition of claim 19, wherein the 

2 polynucelotide probe comprises at least 3 5 nucleotide which 

3 are substantially identical or substantially complementary to 

4 dhc-l. 

5 22. A composition of claim 19, wherein the 

6 polynucleotide probe is immobilized on a solid substrate or 

7 DNA probe array. 

1 23. A method for identifying individuals having an 

2 increased risk for developing premature atherosclerosis, the 

3 method comprising determining the relative abundance of a 

4 homocysteine-regulated mRNA in a cellular sample obtained from 

5 an individual as compared to a predetermined standard value 

6 and/or as compared to a cellular sample obtained from one or 

7 more normal individuals predetermined to have serum 
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8 homocysteine levels within one standard deviation of the 

9 population mean value; cellular samples having a relative 

10 abundance of said homocysteine -regulated gene which is in a 

11 predetermined pathognomonic range are thereby identified as 

12 hyperhomocystenemia candidates and/or atherosclerosis 

13 candidates. 

1 24 . A diagnostic kit for detecting a pathological 

2 condition, such as atherosclerosis and hyperhomocysteinemia, 

3 wherein the kit contains at least one polynucleotide 

4 predetermined to hybridize to a homocysteine -regulated mRNA or 

5 to prime amplification of a homocysteine-regulated mRNA. 

1 25. The use of . a polynucleotide comprising a 

2 sequence substantially identical to or complementary to at 

3 least 25 contiguous nucleotides of a predetermined 

4 homocysteine-regulated mRNA for diagnosis and treatment of 

5 disease. 
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